Video generation

Text- and image-to-video diffusion. The hosted models lead on length, motion and consistency; open models (HunyuanVideo, Wan, LTX-Video, Mochi) bring generation to local GPUs, increasingly in real time on consumer cards.

Providers

The leading hosted services — sign up and use them via app or API.

ProviderFromStrengthsAccess
Sora 2OpenAILong, coherent shots; audioApp · API
Veo 3GoogleHigh fidelity, native audioApp · API
Gen-4RunwayCreative control, editing suiteApp · API
Dream MachineLumaFast, expressive motionApp · API
KlingKuaishouStrong motion & realismApp · API
SeedanceByteDanceMulti-shot, fastAPI
HailuoMiniMaxCinematic, cheapApp · API

Open-source tools

Run these yourself on a local or rented GPU. Open weights are free to use, private, and finetunable.

Tencent's open 13B video model — strong open-weight quality.

modelopen

Alibaba's open text/image-to-video family, widely used in ComfyUI.

modelopen

Lightricks' fast open model — near real-time on a single GPU.

modelfast

Genmo's open, high-motion diffusion model.

modelopen

Zhipu/THUDM open text-to-video models in several sizes.

modelopen

An open reproduction of the Sora recipe, training code included.

modelopen

Real-time autoregressive video generation research + code.

realtimeresearch

The same node UI runs most open video models end-to-end.

UI

What you need to run it

See GPU prices to buy a card, hosting to rent one by the hour, and GPU programming to understand the libraries underneath. VRAM is the deciding factor — check each tool's model card for its memory needs.