OpenAI-compatible · API live

All your favorite
open source LLMs.

Access 14 leading open source models — Qwen, GLM, DeepSeek, MiniMax, Kimi, Mistral, Llama and more — through a single base URL and a single key.

Kadegate
Kadegate
Base URL
https://kadegate.com/api/v1
M
14
Models available
/
1
Endpoint, OpenAI-compatible
~
SSE
Streaming, identical format
$
$5
Starter plan, weekly
Who it's for

Built for every workflow.

One key. Every frontier model. Whoever you are, you stop babysitting provider accounts and start shipping.

Get started
01
Developers
One .env key. Swap models by editing a single string. No new SDKs, no new dashboards.
02
AI Agent builders
Route planning to a reasoning model, classification to a fast small one — same endpoint, same auth, zero glue code.
03
Startups
Prototype on free models, deploy on the best ones. Switch providers when the next frontier model drops — without touching your codebase.
04
Researchers
Run the same prompt across GPT-4o, Claude, Gemini, DeepSeek in one loop. Compare quality, latency, cost — from a single script.

Three lines. That's the whole change.

If your code already uses the OpenAI SDK, point it here. Nothing else moves.

from openai import OpenAI

client = OpenAI(
    base_url="https://kadegate.com/api/v1",
    api_key="YOUR_KEY",
)

resp = client.chat.completions.create(
    model="qwen/qwen3.6-flash",
    messages=[{"role": "user", "content": "Hello"}],
)
print(resp.choices[0].message.content)
import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://kadegate.com/api/v1",
  apiKey: process.env.KADEGATE_KEY,
});

const r = await client.chat.completions.create({
  model: "qwen/qwen3.6-flash",
  messages: [{ role: "user", content: "Hello" }],
});
console.log(r.choices[0].message.content);
curl https://kadegate.com/api/v1/chat/completions \
  -H "Authorization: Bearer $KADEGATE_KEY" \
  -H "Content-Type: application/json" \
  -d '{"model":"qwen/qwen3.6-flash","messages":[{"role":"user","content":"Hello"}]}'

Built for production.

The boring infrastructure work, already finished.

/

OpenAI-compatible

Use the official SDK or curl. Change the base URL and the API key — that's it.

~

Real streaming

Server-sent events with the same data: frames and [DONE] sentinel.

$

Pay only for what you use

Credit-based, per-token pricing. Top up your balance and spend it across any model. No subscriptions.

K

Key management

Per-project keys. Rotate, revoke, audit — every request is logged.

Free credits to start

Every new account gets free credits — no card required. Start building in seconds.

A

Honest analytics

Per-key usage, error rates, token counts and full request logs in your dashboard.

Why not just use providers directly?

Managing five provider accounts is a part-time job. Here's the honest comparison.

What you need 5 separate providers Kadegate
API keys to manage 5 keys, 5 dashboards 1 key
Switch models Rewrite SDK config Change one string
Billing Separate bill per provider One credit balance
Streaming format Different per provider Identical SSE always
Usage logs Scattered across portals One dashboard, per-key
New model support Update each integration Instant, zero changes
Rate limit handling Build your own fallback Automatic failover
Your keys, your data
We never store request payloads. Keys are hashed at rest.
99.9% uptime SLA
Live status at /status. Automatic provider failover on errors.
Full request logs
Every request logged: model, tokens, latency, status code.
OpenAI SDK native
No custom client. The official SDK works unchanged, out of the box.

Pay only for what you use

No subscriptions. Add credits, spend them across any model, priced per token.

Free to start
$5 free credit

Every new account gets $5 in credits — no card required. Then top up anytime.

Create your account
Model Input / 1M Output / 1M Cache / 1M
google/gemma-4-26b-a4b-it $0.06 $0.33 $0
google/gemma-4-31b-it $0.12 $0.36 $0
meta-llama/llama-4-maverick $0.15 $0.6 $0
minimax/minimax-m2.7 $0.27 $1.08 $0
minimax/minimax-m3 $0.3 $1.3 $0
qwen/qwen3.5-plus-20260420 $0.3 $1.8 $0
xiaomi/mimo-v2-flash $0.1 $0.3 $0
z-ai/glm-4.7-flash $0.06 $0.4 $0
Anthropic: Claude Opus 4.6 $5 $25 $0
claude-opus-4-8 $5 $25 $0.5
claude-opus-4-7 $5 $25 $0
gpt-5.4-nano $0.2 $1.25 $0

Prices in USD per 1,000,000 tokens. See all models →