MakeThisVid generates video from a text prompt — describe a scene in natural language, and the AI renders a 6-second 1080p video with audio in 45–90 seconds. No image upload required; text is the only input. Plans from $19.99/mo; starter pack $2.99 for one video.
AI Video Generator from Text: Type a Prompt, Get a Video
A text prompt is the most expressive input for AI video. No product photo required. No reference image. Just words: describe the scene, the subject, the movement, the light, the mood — and the AI renders it. MakeThisVid's text-to-video generation turns a sentence into a 6-second video clip with audio. You're not choosing from a template library or assembling existing stock footage. You're generating original video from a description that doesn't exist yet. The output is a clean 720p or 1080p MP4, watermark-free, commercial use included — ready for ads, social content, or client delivery. Text input works best when you're precise. The AI interprets your description literally, so the more specific the scene details, the more intentional the output. This page walks through how to write prompts that produce usable results.
How to Use MakeThisVid
From prompt to downloadable MP4, ready to deploy.
-
Open the Create page in Text → Video mode
Go to makethisvid.com/create. Text → Video is the default mode. A prompt field accepts your scene description in natural language — no special syntax required.
-
Write a specific scene description
Effective text prompts describe: (1) the subject — what or who is in the frame; (2) the environment — surface, background, setting; (3) the lighting — natural, studio, dramatic, soft; (4) the camera movement — push-in, orbit, static, dolly; (5) the mood or aesthetic — cinematic, vibrant, minimal, gritty. All five in one prompt produces the most controllable results.
-
Iterate on the prompt
Text-to-video is iterative. Your first generation reveals how the AI interpreted your description. Adjust specifics — add a camera movement if the shot was static, specify lighting if it was flat, add texture words if the scene felt generic. Each generation is 1–2 credits; the cost to iterate is low.
-
Select resolution and generate
720p (1 credit) for early-stage tests. 1080p (2 credits) for production-quality output. Click Generate. The AI renders the video in 45–90 seconds. Credits refund automatically on failures.
-
Download and use the video
Download the MP4 — audio is included, watermark-free, commercial use licensed. Post it, run it as an ad, send it to a client, or use it as a visual asset in any medium.
Who Uses MakeThisVid for This
Ad creative without product assets
Don't have product photos or video footage? Generate the ad entirely from text. Describe the product, its setting, and the visual treatment — the AI renders the scene from scratch.
Concept validation before production
Generate a text-to-video rough of a campaign concept before committing to a shoot. Show clients or stakeholders what the visual direction looks like — at the cost of a few credits.
Abstract or atmospheric content
Some of the strongest text-to-video outputs are abstract — light effects, color gradients, nature scenes, mood sequences. These don't require product photography and are particularly effective as brand-feel or ambient social content.
Frequently Asked Questions
Related
Type a scene. Get a video.
Describe the product, setting, and mood — AI renders a 6-second 1080p clip with audio in under 2 minutes.
Try MakeThisVid