The fastest agentic AI models. One API.

Run DeepSeek, GLM and MiMo UltraSpeed through a single OpenAI-compatible endpoint, at cost + 10% with transparent pay-as-you-go pricing.

Works with Hermes Agent, OpenClaw, OpenCode and other OpenAI-compatible tools.

client.ts
Drop in
// Before: your existing OpenAI client
const client = new OpenAI({ apiKey: process.env.OPENAI_API_KEY })

// After: point at gateway.fast, keep everything else
const client = new OpenAI({
  apiKey: process.env.GATEWAY_API_KEY,
  baseURL: "https://api.gateway.fast/v1"
})
1000 TOK/S

Speed First

The fastest agentic models

MiMo V2.5 Pro UltraSpeed peaks at 1,000 tokens/second. GLM 5.2 Fast runs at roughly twice standard throughput. Built for agents that cannot wait around.

Browse models

Direct Access

Choose your model

Call any model directly by name: DeepSeek V4 Flash and Pro, GLM 5.2 standard and Fast, MiMo V2.5 Pro UltraSpeed. Buy dollar packs, no subscriptions, full control.

Browse models

Cost + 10%

Transparent pass-through pricing

You pay exactly what the upstream provider charges plus a flat 10% — on every token class, including cache hits. No hidden margins, no rounding games.

See pricing

Drop-in

OpenAI-compatible endpoint

Point your existing OpenAI client at https://api.gateway.fast/v1 and keep everything else. Streaming, tool calls, and structured output all work out of the box.

Read the docs

Live models. Honest pricing.

Direct access to the current gateway.fast catalog, with separate cache-hit pricing where upstream providers report it.

DeepSeek V4 FlashCheapest
T1DeepSeek1 million
$ In / 1M
$0.154
$ Out / 1M
$0.308
Cached input / 1M
$0.0031
DeepSeek V4 Pro
T2DeepSeek1 million
$ In / 1M
$0.4785
$ Out / 1M
$0.957
Cached input / 1M
$0.004
GLM 5.2
T2Fireworks1 million
$ In / 1M
$1.54
$ Out / 1M
$4.84
Cached input / 1M
$0.154
GLM 5.2 FastFast router
T3Fireworks1 million
$ In / 1M
$2.31
$ Out / 1M
$7.26
Cached input / 1M
$0.231
MiMo V2.5 Pro UltraSpeedUltraSpeed
T3Xiaomi MiMo1 million
$ In / 1M
$1.5231
$ Out / 1M
$3.0462
Cached input / 1M
$0.0127

Simple pricing. No subscriptions.

Buy prepaid credit and spend it directly on tokens. No seats, no monthly commitment.

Starter

$10

For testing and small projects

Popular

Builder

$20

For active development

Scale

$50

For production workloads

Credits are metered in micro-dollars at the per-model rates above.

Add credit.
Start calling models.

Checkout creates your account, sends your API key, and opens the dashboard for usage and balance history.