Speed First
The fastest agentic models
MiMo V2.5 Pro UltraSpeed peaks at 1,000 tokens/second. GLM 5.2 Fast runs at roughly twice standard throughput. Built for agents that cannot wait around.
Browse modelsRun DeepSeek, GLM and MiMo UltraSpeed through a single OpenAI-compatible endpoint, at cost + 10% with transparent pay-as-you-go pricing.
Works with Hermes Agent, OpenClaw, OpenCode and other OpenAI-compatible tools.
// Before: your existing OpenAI client
const client = new OpenAI({ apiKey: process.env.OPENAI_API_KEY })
// After: point at gateway.fast, keep everything else
const client = new OpenAI({
apiKey: process.env.GATEWAY_API_KEY,
baseURL: "https://api.gateway.fast/v1"
})Speed First
MiMo V2.5 Pro UltraSpeed peaks at 1,000 tokens/second. GLM 5.2 Fast runs at roughly twice standard throughput. Built for agents that cannot wait around.
Browse modelsDirect Access
Call any model directly by name: DeepSeek V4 Flash and Pro, GLM 5.2 standard and Fast, MiMo V2.5 Pro UltraSpeed. Buy dollar packs, no subscriptions, full control.
Browse modelsCost + 10%
You pay exactly what the upstream provider charges plus a flat 10% — on every token class, including cache hits. No hidden margins, no rounding games.
See pricingDrop-in
Point your existing OpenAI client at https://api.gateway.fast/v1 and keep everything else. Streaming, tool calls, and structured output all work out of the box.
Read the docsDirect access to the current gateway.fast catalog, with separate cache-hit pricing where upstream providers report it.
| Model | Provider | Tier | Context | $ In / 1M tokens | $ Cached in / 1M | $ Out / 1M tokens |
|---|---|---|---|---|---|---|
DeepSeek V4 FlashCheapest | DeepSeek | T1 | 1 million | $0.154 | $0.0031 | $0.308 |
DeepSeek V4 Pro | DeepSeek | T2 | 1 million | $0.4785 | $0.004 | $0.957 |
GLM 5.2 | Fireworks | T2 | 1 million | $1.54 | $0.154 | $4.84 |
GLM 5.2 FastFast router | Fireworks | T3 | 1 million | $2.31 | $0.231 | $7.26 |
MiMo V2.5 Pro UltraSpeedUltraSpeed | Xiaomi MiMo | T3 | 1 million | $1.5231 | $0.0127 | $3.0462 |
Buy prepaid credit and spend it directly on tokens. No seats, no monthly commitment.
Starter
For testing and small projects
Builder
For active development
Scale
For production workloads
Credits are metered in micro-dollars at the per-model rates above.
Checkout creates your account, sends your API key, and opens the dashboard for usage and balance history.