🐎 Coming soonHappy Horse 1.0 by Alibaba · #1 on Artificial Analysis · open-source · API landing late April
VideoGenAI
← All posts
How-toΒ·March 28, 2026Β·9 min read

Prompting for cinematic video: 7 tricks stolen from DPs

Real cinematographers don't say 'hyperrealistic 8K unreal engine'. Here are seven concrete prompting techniques we've lifted from DPs β€” and the before/after they make on your output.

VideoGenAI Team
Creative & content
!TL;DR
β†’Good prompts read like camera direction, not a list of buzzwords.
β†’Lead with lens and motion. Subject comes *after*.
β†’One carefully-chosen reference word beats a pile of vague ones.
β†’"No X, no Y" at the end saves you a post pass.

The best AI prompts look almost nothing like the ones trending on social. Forget "hyperrealistic, 8K, unreal engine, intricate details, masterpiece". Real cinematographers don't talk like that. And β€” it turns out β€” modern video models don't listen to that either.

We spent a week with a working DP, rewrote 40 prompts using her vocabulary, and re-ran every one through our pipeline. Every single output was better. Not *differently weighted* β€” unambiguously better. Here's what she told us.

1. Lead with the camera, not the subject

"Handheld, 35mm, slow push-in on β€”" is better than "a man sitting β€”". Give the model a lens, a motion, and a framing *first*. The subject comes after, because the camera is what the model is actually simulating.

Prompt pattern
Output character
Subject first ("a woman walking down a street")
Flat, stock-footage look
Camera first ("handheld, 35mm, tracking a woman β€”")
Intentional, cinematic, motion-aware

2. Name the light source

"Golden-hour sun from frame right" or "soft key, hard backlight" gives the model a real physical scene to reason about. Light isn't decoration β€” it's half the image.

β˜…Pro tip

If you only add one thing to your existing prompts this week, add the light. You will see the difference on clip #1.

3. Specify the crop

Close-up, medium, wide, aerial. The model picks the wrong one surprisingly often if you don't say.

βœ“Close-up β€” face or detail fills the frame
βœ“Medium β€” waist-up, conversational
βœ“Wide β€” establishing, subject reads as small
βœ“Aerial / top-down β€” strong graphic shape
"Good composition" β€” means nothing to the model, skip it

4. Describe the air

Dust motes. Haze. Rain. Snow. Fog at ankle-height. Smoke rolling over a shoulder. These aren't decoration β€” they *tell the engine where to put depth*. A "cinematic" shot without atmospheric stuff in it is a flat shot. The air between the subject and the camera is what makes it feel like film.

β€œ
If you can see the air, you can feel the room.
β€” An old DP quote, still the best one

5. Use one reference word, not a list

"Fincher-dark" beats "moody, gritty, cinematic, atmospheric, dramatic". Lists of adjectives flatten each other β€” the model averages them and you end up with the blandest thing that satisfies all five.

Diffuse list
Single reference
moody, gritty, cinematic, atmospheric, dramatic
Fincher-dark
warm, happy, bright, sunny, cheerful
Wes-Anderson-warm
sharp, clean, product-like, modern
Apple-keynote-clean

Pick the one word that carries the visual culture, and let it do the work.

6. Skip the "masterpiece, trending, 8K" junk

They don't help modern video models. They used to help older image models on Civitai three years ago. Different era. Including them now is noise β€” every token you spend on junk is a token not spent on something useful.

!Heads up

This is the single most common mistake we see in prompts pasted from Discord. Copy-pasting 2022-era image prompts into 2026 video models genuinely makes them worse, not better.

7. End with what the shot *isn't*

"No text, no watermark, no people" is faster than trying to coax them out in post. Negatives are cheap, post is expensive.

βœ“No text / no captions
βœ“No watermarks / no logos
βœ“No visible crew or equipment
βœ“No subtitles or graphics burned in

Putting it together

A full "DP-style" prompt looks like this:

> *Handheld, 35mm, slow push-in on a woman tying her running shoes on a wet concrete step. Pre-dawn blue hour, practical streetlamp key from frame left, soft fog between camera and subject. Medium close-up. Fincher-dark. No text, no watermark.*

One lens, one motion, one light source, one atmosphere word, one reference, one framing, one exclusion list. That's the whole technique.

Paste one of your old prompts, rewrite it in DP form, run both on your trial tokens. Keep whichever you'd actually ship.

Try it on your next clip

Keep reading