MakeThisVid and Fliki are both AI video tools, but they solve different problems. MakeThisVid generates original AI video scenes from a text prompt or product photo — synthesized footage with AI audio, built for short-form ads and social content. Fliki creates narrated videos by pairing text-to-speech voiceover with stock footage and AI-generated images — suited for content creators who need talking-point videos with narration.
MakeThisVid vs Fliki: Scene Generation vs Text-to-Speech Video
<p>MakeThisVid and Fliki both produce video from text inputs, but the type of output is fundamentally different. MakeThisVid synthesizes original cinematic scenes — if you describe a product in motion or a lifestyle moment, the AI generates that footage from scratch, with ambient AI audio. The output looks like filmed footage, not a slideshow.</p><p>Fliki takes a different approach: you write or paste a script, and Fliki generates a narrated video by pairing a text-to-speech voiceover with relevant stock footage clips or AI-generated images assembled into a slide-style presentation. The result is a narrated explainer or content video — clear, functional, and efficient for talking-point content. Here's how to choose between them.</p>
How to Use MakeThisVid
From prompt to downloadable MP4, ready to deploy.
-
Core output format is different
MakeThisVid generates original AI-synthesized video scenes — footage that looks like it was filmed, with ambient audio. Fliki generates narrated slide-style videos combining voiceover with stock footage clips or AI images — output that looks like an explainer video or a YouTube content video.
-
Pricing comparison
MakeThisVid: Lite ($14.99/mo, 20 credits), Standard ($29.99/mo, 50 credits), Pro ($79.99/mo, 200 credits). Fliki plans start at $28/mo (Standard, 120 minutes/mo) and go up to $88/mo+ for higher output volumes. Fliki is priced by minutes of video; MakeThisVid by credits per clip.
-
Input type and creative process
Fliki is script-first: paste your talking points and it builds around narration. MakeThisVid is visual-first: describe the scene and it generates the footage. Both accept text, but the creative direction is different — narration-led vs visual-led.
-
Use in paid advertising
MakeThisVid's 8-second cinematic clips are built for paid ad placements where visual impact drives the click. Fliki's narrated videos are better suited for YouTube content, explainer ads where narration carries the message, and social content that needs a clear spoken point.
-
Stock footage vs original generation
Fliki's video layer relies on stock footage clips and AI image generation. MakeThisVid generates all footage from scratch — there's no stock library involved. This means MakeThisVid's output is more unique but requires more descriptive prompting; Fliki's output is more predictable but can look stock-library-sourced.
Who Uses MakeThisVid for This
MakeThisVid for visual-first ad campaigns
Product ads, lifestyle moments, and brand atmosphere content — MakeThisVid generates original footage from descriptions or photos. Runs natively as a TikTok, Meta, or YouTube short ad without narration.
Fliki for narrated explainer content
Product explainers, how-to videos, and talking-point content — Fliki's text-to-speech narration with accompanying visuals is efficient for content creators who need clear spoken-word delivery with supporting visuals.
Fliki for high-volume content production
Fliki's minutes-based model suits high-volume content creators who need to produce many narrated videos efficiently. If you're producing 10+ narrated explainer videos per month, Fliki's model may be more cost-efficient for that specific format.
Frequently Asked Questions
Related
Generate original AI video scenes
Describe the moment or drop a product photo. 45 seconds to a downloadable 1080p MP4 — audio included, commercial use licensed.
Try MakeThisVid