Together AI
GPU clusters and a serverless inference platform from a research-focused cloud. Rents InfiniBand H100/H200 clusters for training alongside per-token inference for open models.
On-demand USD prices are indicative snapshots from June 2026 for orientation only — they are not live. Spot/community and committed-use rates are often 30–70% lower. Always confirm on the provider's pricing page before committing.
On-demand GPUs
| GPU | VRAM | $/hr | Type |
|---|---|---|---|
| H200 SXM | 141 GB | $3.15 | On-demand |
| H100 SXM | 80 GB | $2.39 | On-demand |
| A100 PCIe | 80 GB | $1.30 | On-demand |
| L40S | 48 GB | $1.20 | On-demand |