MakeThisVid and D-ID solve different AI video problems. MakeThisVid generates original AI video scenes from a text prompt or product photo — synthesized footage with ambient audio, built for ads and social content. D-ID specializes in AI talking portrait videos — animating a still photo of a person to lip-sync to a script or audio track. D-ID also offers a generative avatar presenter feature.

MakeThisVid vs D-ID: Scene Generation vs AI Talking Portrait Video

<p>D-ID is best known for its core technology: taking a still photograph of a person and animating it to lip-sync to an audio recording or script — the talking portrait. This capability is used in digital memorial applications, personalized video creation, and avatar presenter videos for marketing and training. D-ID also offers broader AI avatar presenter functionality similar to other platforms in that category.</p><p>MakeThisVid does something different: it generates original video scenes — synthesized footage of environments, products, and atmospheric moments from a text description or photo. No portraits, no lip-sync, no avatars. The output is visual scene content, built for paid ads and social media where cinematic footage is the creative format.</p>

How to Use MakeThisVid

From prompt to downloadable MP4, ready to deploy.

  1. Core technology difference

    D-ID's core technology is photo animation — taking a still image of a person and animating it to lip-sync to audio. MakeThisVid generates AI video scenes from text descriptions or product photos — synthesized footage of places, objects, and moments. Fundamentally different capabilities.

  2. Pricing comparison

    MakeThisVid: Lite ($14.99/mo, 20 credits), Standard ($29.99/mo, 50 credits), Pro ($79.99/mo, 200 credits). D-ID plans start at $5.99/mo (Lite, 10 credits) and scale to $49.99/mo+ for higher volumes. D-ID's credit system maps to video minutes.

  3. Talking portrait vs scene footage

    D-ID's output is a moving, speaking portrait of a person — realistic lip-sync animation on a still photo. MakeThisVid's output is a synthesized scene — an environment or product moment in motion. Neither tool can produce what the other does.

  4. Personalized video use case

    D-ID's photo-to-video technology enables personalized video at scale — creating individual video messages using a speaker's photo and a personalized script. This is a sales and marketing use case (video prospecting, personalized outreach) that MakeThisVid doesn't support.

  5. Advertising and social media creative

    MakeThisVid's 8-second scene clips are purpose-built for paid ad placements and social media creative. D-ID's output — talking portraits and avatar presenters — is better suited for personalized video messages, training content, and explainer ads featuring a presenter.

Who Uses MakeThisVid for This

MakeThisVid for product and scene-based ad creative

Original synthesized footage of products, environments, and lifestyle moments — for TikTok, Instagram, and Meta paid ads. Visual-first creative where the scene does the work.

D-ID for personalized video outreach

Sales teams using D-ID to send personalized video messages where a presenter's photo is animated with a custom script for each prospect. This is a specific sales enablement use case unique to D-ID's core technology.

D-ID for talking portrait content

Animating historical photos, creating digital memorial videos, or producing avatar presenter content from a still image. D-ID's talking portrait capability has applications across entertainment, education, and enterprise communications.

Frequently Asked Questions

D-ID's primary capability is animating still photos of people to lip-sync with audio, and avatar presenter videos. It does not synthesize environmental scenes or product footage from visual descriptions the way MakeThisVid does.
No. MakeThisVid generates AI scene videos from descriptions and product photos — it cannot animate a portrait to lip-sync with audio. D-ID's talking portrait technology is unique to that platform.
MakeThisVid is better suited for product scene ads — cinematic product footage from a photo or description. D-ID could work for presenter-style product walkthrough videos, but that's a more complex production workflow than MakeThisVid's photo-to-video mode.
D-ID has offered a free trial with limited credits. MakeThisVid has no free tier — credits are required for all renders, with automatic refunds on failed renders.
MakeThisVid's 8-second cinematic clips are purpose-built for short-form paid ad placements. D-ID's talking portrait and avatar content can work in certain ad formats but is not optimized for the visual-first, short-form ad creative that performs best on TikTok and Instagram Reels.

Generate original AI video scenes

Describe the moment or drop a product photo. 45 seconds to a downloadable 1080p MP4 — audio included, commercial use licensed.

Try MakeThisVid