Skip to main content
Orbitrage routes across models from Azure (GPT and open models via Foundry), Anthropic, OpenAI, Groq, and xAI. Let the router pick (model: "auto") or pin any model by id.
The catalog below is representative. The live, authoritative list for your account is always GET https://api.orbitrage.ai/v1/models — see List models. Availability depends on which providers are configured and any BYOK keys you’ve saved.

Pricing & tiers

Prices are USD per 1M tokens (input / output). Pooled-credit calls add a small platform markup (currently 2.5%); BYOK calls cost nothing against your Orbitrage credits.

Frontier

ModelProviderInputOutputVisionContext
claude-fable-5Anthropic$10.00$50.00200K
claude-opus-4-8Anthropic$5.00$25.00200K
claude-opus-4-7Anthropic$5.00$25.00200K
claude-sonnet-4-6Anthropic$3.00$15.00200K
gpt-5.5Azure$5.00$20.00400K
claude-fable-5 is Anthropic’s top-tier model (above Opus) — the highest-quality option in the catalog, used for the hardest reasoning and agentic tasks.

High

ModelProviderInputOutputVisionContext
gpt-5.4Azure$1.25$5.00400K
gpt-5.3-chatAzure$1.50$6.00400K
Kimi-K2.6Azure Foundry$0.95$4.00200K
DeepSeek-V3.2Azure Foundry$0.40$1.10131K
grok-4xAI$1.25$2.50256K

Mid

ModelProviderInputOutputVisionContext
gpt-5.4-nanoAzure$0.20$1.25256K
gpt-4oAzure$2.50$10.00128K
FW-MiniMax-M2.5Azure Foundry$0.30$1.20262K
DeepSeek-V4-FlashAzure Foundry$0.14$0.28131K

Basic

ModelProviderInputOutputVisionContext
gpt-5-nanoAzure$0.10$0.40
gpt-4o-miniAzure$0.165$0.66
gpt-5.4-miniAzure$0.10$0.40256K
llama-3.1-8b-instantGroq$0.05$0.08131K

Image

ModelProviderBilling
gpt-image-2AzurePer image

Audio (managed, included)

Speech-to-text and text-to-speech are offered as a managed service on Deepgram — no BYOK needed. See Audio.
ModelProviderTypeBilling
nova-3DeepgramSpeech-to-textPer minute of audio
nova-3-multilingualDeepgramSpeech-to-textPer minute of audio
nova-2DeepgramSpeech-to-textPer minute of audio
aura-2-thalia-enDeepgramText-to-speechPer 1,000 characters
The full catalog also includes the DeepSeek R1 reasoning model, additional GPT-5 variants (gpt-5.3, gpt-chat-latest), llama-3.1-70b-versatile on Groq, and the full xAI Grok family (grok-4.3, grok-4-fast, grok-3 variants, and more). Query /v1/models for everything enabled on your account.

Cost, savings, and the markup

Every call records four cost figures, so you can see exactly where your money goes:
FieldMeaning
Cost (cost_usd)What you’re billed — upstream price plus the 2.5% markup. 0 for BYOK.
Provider costThe raw upstream price Orbitrage paid (internal).
Baseline costWhat the same call would have cost on a frontier baseline (claude-sonnet-4-6).
Saved (saved_usd)baseline − cost — the routing savings, never negative.
The baseline lets the dashboard answer “how much did routing save me?” by comparing every routed call against a single frontier model.

Vision & multimodal

Vision-capable models accept image content blocks in the standard OpenAI format, and the router prefers one automatically when your prompt contains images. The engine also exposes image generation/editing and audio (transcription, translation, speech) endpoints — see the API reference.

Pinning vs. aliases

  • Pin a model by passing its exact id — no scoring, straight to that model.
  • Route by passing auto (or router / default / orbitrage).
  • Model ids match flexibly — claude-sonnet-4.6 and claude-sonnet-4-6 resolve to the same model, and family names (e.g. claude-opus, grok-4) resolve to the appropriate provider and tier.