NEWUp to 14× multiplier — locked in at purchase
Neokens

Neokens — AI tokens for up to 10× less.One key. Every model.

Neokens is a pay-as-you-go API gateway for Claude, GPT-5, Gemini 3 Pro and Llama. Buy bulk credits once, spend them across every major model — same OpenAI-compatible API, same speed, a fraction of the cost.

Credits never expire No subscription Pay only what you use
neokens-console / credits & usage
live
WALLET · PREVIEW
CREDIT BALANCE
$100.00
from $10 top-up
MULTIPLIER
10×
locked in
MODELS
9+
OpenAI/Anthropic/Google
EXPIRES
Never
credits never expire
COMPLETION · gpt-5-mini
POST /v1/chat/completions
↳ model=gpt-5-mini, stream=true
1,284 in · 642 out tokens
↳ list=$0.0192 · you=$0.0019
200 OK · 1.42s · saved 90%
MODELS · ROUTEDall healthy
OAIOpenAI · GPT-5$0.10/M
ANTAnthropic · Claude$0.30/M
GGLGoogle · Gemini$0.18/M
How it works · 04 steps

Buy credits once.
Spend on every AI model.

01

Top up once

Pick a credit pack from $10 to $1,000+. Pay in seconds with card, ACH, or crypto.

02

Get the multiplier

Every dollar buys you up to 10× the equivalent provider credit. Locked in at top-up.

03

Drop in our key

Swap one base URL. Your existing OpenAI / Anthropic SDK code keeps working — point it at neokens.

04

Spend across 16+ models

GPT-5.x Codex/Flagship, Claude 4.5/4.6, Gemini 2.5/3.x — one key, one balance, no per-vendor accounts.

Model catalog

One balance.
Every model worth using.

Prices below are USD per 1M input tokens. Output and image rates are listed in the docs. All models share the same API.

ANTHROPICFlagship

Claude Opus 4.7

The absolute peak of AI reasoning. For the most complex challenges.

$15.00/M
$1.50/M
save 90%
ANTHROPIC

Claude Sonnet 4.6

Exceptional performance for large-scale enterprise development.

$3.00/M
$0.30/M
save 90%
ANTHROPICPopular

Claude Sonnet 4.5

The balanced choice for rapid feature development and debugging.

$3.00/M
$0.30/M
save 90%
ANTHROPIC

Claude Haiku 4.5

Ultra-fast, efficient responses for high-volume automated tasks.

$0.80/M
$0.08/M
save 90%
GOOGLEFlagship

Gemini 3.1 Pro

State-of-the-art multimodal reasoning with 1M+ context window.

$1.25/M
$0.18/M
save 86%
GOOGLEPopular

Gemini 2.5 Pro

Exceptional coding and complex problem solving performance.

$1.25/M
$0.18/M
save 86%
GOOGLE

Gemini 3.1 Pro Low

Efficient Pro-tier for cost-sensitive and high-volume workloads.

$0.50/M
$0.07/M
save 86%
GOOGLE

Gemini 3 Flash

High-speed inference with reliable quality for daily tasks.

$0.075/M
$0.01/M
save 87%
OPENAIFlagship

GPT-5.5

Pinnacle reasoning, multimodal mastery, unmatched on every benchmark.

$10.00/M
$1.00/M
save 90%
OPENAI

GPT-5.4

Next-level reasoning and superior performance across all tasks.

$5.00/M
$0.50/M
save 90%
OPENAI

GPT-5.1

Next-gen flagship with unparalleled reasoning and multimodal capabilities.

$2.50/M
$0.25/M
save 90%
OPENAIPopular

GPT-5.1 Codex

Code-specialized variant. Purpose-built for software engineering.

$2.50/M
$0.25/M
save 90%
OPENAI

GPT-5.2 Codex

The most capable code model. Ideal for complex refactors and architecture.

$3.00/M
$0.30/M
save 90%
OPENAI

GPT-5.3 Codex

Ultra-fast code generation for real-time development workflows.

$1.50/M
$0.15/M
save 90%
The math · Multiplier

$1 in.
$10 out.

We aggregate enterprise commitments across all major providers and pass the volume discount back to you. Drag the slider — see exactly what your top-up unlocks.

No subscription
Pay in chunks. Burn down at your pace. Refill when you want.
Credits never expire
Buy a year of inference and run it whenever — no clock ticking.
Provider-agnostic
Switch from GPT to Claude to Llama mid-project without billing chaos.
YOU PAY
$100.00
$10$1,000
YOU GET · 10× CREDIT
$1,000.00
7,692,308 input tokens on GPT-5 mini
Drop-in · OpenAI compatible

One base URL.
Zero rewrites.

If your code already calls OpenAI, swap the base URL and you're done. Anthropic, Google, Llama — same shape, same SDK, one bill.

Contact us
import OpenAI from "openai";

const client = new OpenAI({
  apiKey: process.env.NEOKENS_KEY,
  baseURL: "https:var(--fg-3)">//api.quatarly.cloud/v1",
});

const r = await client.chat.completions.create({
  model: "claude-sonnet-4-6-thinking",     "color: var(--fg-3)">// any of 9+ models
  messages: [{ role: "user", content: "Write a haiku about credits." }],
});

console.log(r.choices[0].message.content);
10×
Credit multiplier
Up to. Locked in at top-up.
9+
Models supported
OpenAI, Anthropic, Google, more
$0
Subscription cost
Pay only what you spend
<200ms
Routing overhead
Same speed, lower price
Top-up packs

Pay once. Run thousands of completions.

Starter
$10×10
→ $100 credit
  • 10× multiplier
  • All 9+ models
  • Email support
  • Credits never expire
Popular
Pro
$25×10
→ $250 credit
  • Everything in Starter
  • Priority routing
  • Slack support
  • Usage analytics
  • Spend alerts
Scale
$100+×10
→ $1,000+ credit
  • Volume bonuses
  • Dedicated capacity
  • Invoice billing
  • Solutions engineer
  • 99.99% SLA
FAQ

Questions, answered.

We aggregate enterprise volume commitments across every major AI provider — OpenAI, Anthropic, Google, xAI, Meta — and pass the bulk-discount back to you as a credit multiplier. $10 in cash becomes up to $100 of usage credit, locked in at the moment of top-up. The exact multiplier per model is shown on the catalog page.

Stop overpaying for tokens.
Start shipping AI features.