Model registry
Pick a model. Send a request.
Frontier open weights served on AWS Trainium and NVIDIA GPUs. Same OpenAI-compatible API for all of them, and your fine-tunes price the same as the base model.
| Model | Context | TPS | $/1M in | $/1M out | Hardware | |
|---|---|---|---|---|---|---|
GPT-OSS 120B OpenAI · 120B (MoE, ~5B active) · Apache 2.0 | 131.072k | 70 | $0.04 | $0.18 | AWS Trainium | View |
DeepSeek V4 Pro DeepSeek · Frontier MoE · DeepSeek License | 1,000k | 70 | $0.43 | $0.87 | NVIDIA H200 | View |
DeepSeek V4 Flash DeepSeek · Mid-tier MoE · DeepSeek License | 1,000k | 70 | $0.14 | $0.28 | AWS Trainium | View |
Kimi K2.6 Moonshot · 1T MoE (~32B active) · Modified MIT | 262.144k | 70 | $0.74 | $3.49 | NVIDIA H100 | View |
DeepSeek V3.2 DeepSeek · 671B MoE (~37B active) · DeepSeek License | 131.072k | 70 | $0.25 | $0.38 | AWS Trainium | View |