Image generation

Text-to-image and image editing diffusion models. The open models (Stable Diffusion, FLUX, Z-Image) run locally on a 8–24 GB GPU through tools like ComfyUI; the hosted services lead on prompt-following and text rendering.

Providers

The leading hosted services — sign up and use them via app or API.

ProviderFromStrengthsAccess
Midjourney v7MidjourneyAesthetic quality, styleApp · API
Nano Banana ProGoogleEditing, text in images, controlApp · API
GPT ImageOpenAIPrompt-following, in-chat editingAPI · app
FLUX.1Black Forest LabsOpen + Pro; photoreal, strong textOpen · API
IdeogramIdeogramBest-in-class text renderingApp · API
RecraftRecraftBrand/vector design, controlApp · API
FireflyAdobeCommercially-safe, in PhotoshopApp · API

Open-source tools

Run these yourself on a local or rented GPU. Open weights are free to use, private, and finetunable.

The open diffusion model that started the local-image wave.

modelopen

Black Forest Labs' open-weight model — top open quality and text.

modelopen

Tongyi's efficient single-stream diffusion model.

modelopen

Node-graph UI for building diffusion pipelines; the power-user standard.

UI

The classic web UI for Stable Diffusion with a huge extension ecosystem.

UI

Midjourney-like simplicity on top of SDXL — sensible defaults, minimal fuss.

UIeasy

Tencent's open diffusion transformers for images and 3D assets.

modelopen

FLUX-2 image generation as pure C inference — a minimal study implementation.

learn

What you need to run it

See GPU prices to buy a card, hosting to rent one by the hour, and GPU programming to understand the libraries underneath. VRAM is the deciding factor — check each tool's model card for its memory needs.