Together AI

GPU clusters and a serverless inference platform from a research-focused cloud. Rents InfiniBand H100/H200 clusters for training alongside per-token inference for open models.

Open Together pricing ↗ ← All providers

On-demand USD prices are indicative snapshots from June 2026 for orientation only — they are not live. Spot/community and committed-use rates are often 30–70% lower. Always confirm on the provider's pricing page before committing.

On-demand GPUs

GPUVRAM$/hrType
H200 SXM141 GB$3.15On-demand
H100 SXM80 GB$2.39On-demand
A100 PCIe80 GB$1.30On-demand
L40S48 GB$1.20On-demand