Together AI

GPU clusters and a serverless inference platform from a research-focused cloud. Rents InfiniBand H100/H200 clusters for training alongside per-token inference for open models.

Open Together pricing ↗ ← All providers

On-demand USD prices are indicative snapshots from June 2026 for orientation only — they are not live. Spot/community and committed-use rates are often 30–70% lower. Always confirm on the provider's pricing page before committing.

On-demand GPUs

GPU	VRAM	$/hr	Type
H200 SXM	141 GB	$3.15	On-demand
H100 SXM	80 GB	$2.39	On-demand
A100 PCIe	80 GB	$1.30	On-demand
L40S	48 GB	$1.20	On-demand