Pricing

Pay for the
tokens you use.

Every request is metered to the token. We charge the provider's per-million rate plus 25% — and we show you both numbers, every time. No subscriptions. No seat fees. No minimum.

Top up

Buy credits to fund your account ($1 = 1 credit)

Make requests

Each call charged: tokens × provider rate × 1.25

See the cost

Real provider cost + our margin, on every response

Starter

$25

25 credits

≈ 6.7M Sonnet 4.6 input
≈ 4.0M Opus 4.7 input

Buy credits

What does $25 get you?

All prices below include our 25% margin. The "$25 buys" column shows input tokens you can send for $25 at that model's rate.

Model	In $/M	Out $/M	$25 buys	Best for
Claude Opus 4.7 1M context flagship	$6.25	$31.25	4.0M	The frontier. Use it when it has to be right.
Claude Sonnet 4.6 200K context flagship	$3.75	$18.75	6.7M	The workhorse. Best balance of speed + quality.
Claude Haiku 4.5 200K context	$1.25	$6.25	20.0M	Reviews, formatting, quick lookups.
GPT-5.5 1M context	$6.25	$37.50	4.0M	OpenAI's frontier. Try it next to Opus 4.7.
GPT-5.4 1M context	$3.13	$18.75	8.0M	Solid GPT-5 default.
GPT-5.4 Mini 400K context	$0.940	$5.63	26.6M	Budget GPT-5 — small tasks, sub-agents.
Gemini 3.1 Pro Preview 1M context	$2.50	$15.00	10.0M	Google's best. 1M context.
Gemini 3 Flash Preview 1M context value	$0.625	$3.75	40.0M	Cheapest pro-tier with 1M context.
Grok 4.20 Reasoning 2M context	$2.50	$7.50	10.0M	xAI's reasoning model. 2M context.
Grok 4.1 Fast 2M context	$0.250	$0.625	100.0M	Fast and cheap. Great for sub-tasks.
DeepSeek V4 Pro 1M context value	$2.18	$4.35	11.5M	Cheapest reasoning model.
DeepSeek V4 Flash 1M context value	$0.175	$0.350	142.9M	Cheapest model on the platform. Period.

Cost = (input tokens × in $/M) + (output tokens × out $/M). Output tokens are typically 2–6× more expensive than input on most providers.

Plus image, video, voice, and TTS models priced per asset. Full table in the docs.

Ready to build?

Get started

FAQ

Frequently asked questions

How am I actually charged?

Per request, per token. Every API call is metered: input tokens × the per-million input rate, output tokens × the per-million output rate. We charge the provider rate plus 25%. The exact cost shows in the response headers and on every entry in your dashboard. No rounding up, no per-task fudge factors.

Why credits then?

Credits are just a top-up balance — $1 = 1 credit. Your balance ticks down as actual API costs accrue. Nothing more clever than that. We don't pretend a 'credit' equals a fixed amount of compute, because it doesn't — different models cost wildly different amounts per token.

Do credits expire?

No. Your money is your money. Use it whenever.

Can I choose which model runs each call?

Yes. Pick your model per call, per mission, per task. Mix providers — Opus 4.7 for planning, Sonnet 4.6 for execution, V4 Flash for cheap throwaway calls. Your call. We just route the request and bill you the real cost plus margin.

What happens if I run out of credits mid-mission?

The mission pauses. Top up and resume exactly where you left off. No work lost, no restarts.

Why the 25% margin?

Because the orchestration engine, multi-provider routing, sessions, billing, observability, and the live workspace aren't free to build or run. Every API response shows the raw provider cost and our margin separately — no obfuscation. If you're a heavy user (>$50/mo) you can apply for the Developer tier (12.5%) or Lifetime (0%). See the developers page.

Do I need a credit card to try it?

No. Sign up, get free starter credits. Top up only when you want to keep going.

Pay for the tokens you use.