►
1,000× PRICE SPREAD DETECTED.
CHEAPEST OUTPUT: Mistral Nemo AT $0.030/1M ►
PRICIEST: GPT-5.5 AT $30.0/1M.
THREE ORDERS OF MAGNITUDE SEPARATE THE LOWEST- AND HIGHEST-COST SPECIES IN THIS DEX.
MULTIMODAL (IMAGE IN)
30 / 50
REASONING-CAPABLE
44 / 50
LARGEST CONTEXT
GPT-5.5 (1M)
TYPE CHART — VENDOR / TYPE MAPPING (DATA ENCODING)
◈PSYCHICANTHROPIC
◉WATERDEEPSEEK
⚡ELECTRICGOOGLE
◌GHOSTMINIMAX
◇FLYINGMISTRALAI
✦FAIRYMOONSHOTAI
◎POISONNEX AGI
▬GROUNDNVIDIA
⬡STEELOPENAI
○NORMALOPENROUTER
✱ICEPOOLSIDE
◆FIREQWEN
❀GRASSSTEPFUN
✶FIGHTINGTENCENT
▲DRAGONXIAOMI
◼DARKZ AI
#001DeepSeek V4 FlashHP 210
◉ WATERDEEPSEEK
CTX: 1M
DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model from DeepSeek with 284B total parameters and 13B activated parameters, supporting a 1M-token context window.
► Reasoning60
► Tool Use50
#002MiMo-V2.5HP 210
▲ DRAGONXIAOMI
CTX: 1M
MiMo-V2.5 is a native omnimodal model by Xiaomi.
#003MiniMax M3HP 210
◌ GHOSTMINIMAX
CTX: 1M
MiniMax-M3 is a multimodal foundation model from MiniMax.
#004Hy3 previewHP 50
✶ FIGHTINGTENCENT
CTX: 262K
Hy3 preview is a high-efficiency Mixture-of-Experts model from Tencent designed for agentic workflows and production use.
► Reasoning60
► Tool Use50
#005Owl AlphaHP 210
○ NORMALOPENROUTER
CTX: 1M
Owl Alpha is a high-performance foundation model designed for agentic workloads.
► Tool Use50
► Structured Out30
#006Claude Opus 4.7HP 200
◈ PSYCHICANTHROPIC
CTX: 1M
Opus 4.7 is the next generation of Anthropic's Opus family, built for long-running, asynchronous agents.
#007DeepSeek V4 ProHP 210
◉ WATERDEEPSEEK
CTX: 1M
DeepSeek V4 Pro is a large-scale Mixture-of-Experts model from DeepSeek with 1.6T total parameters and 49B activated parameters, supporting a 1M-token context window.
► Reasoning60
► Tool Use50
#008Claude Opus 4.8HP 200
◈ PSYCHICANTHROPIC
CTX: 1M
Claude Opus 4.8 is Anthropic's most capable generally available model in the Opus family.
#009GLM 5.2HP 210
◼ DARKZ AI
CTX: 1M
GLM 5.2 is a large-scale reasoning model from Z.ai.
► Reasoning60
► Tool Use50
#010Claude Sonnet 4.6HP 200
◈ PSYCHICANTHROPIC
CTX: 1M
Sonnet 4.6 is Anthropic's most capable Sonnet-class model yet, with frontier performance across coding, agents, and professional work.
#011Step 3.7 FlashHP 50
❀ GRASSSTEPFUN
CTX: 256K
Step 3.7 Flash is StepFun's latest high-efficiency multimodal Mixture-of-Experts model.
#012GPT-5.5HP 210
⬡ STEELOPENAI
CTX: 1M
GPT-5.5 is OpenAI’s frontier model designed for complex professional workloads, building on GPT-5.4 with stronger reasoning, higher reliability, and improved token efficiency on hard tasks.
#013Gemini 3 Flash PreviewHP 210
⚡ ELECTRICGOOGLE
CTX: 1M
Gemini 3 Flash Preview is a high speed, high value thinking model designed for agentic workflows, multi turn chat, and coding assistance.
#014DeepSeek V3.2HP 30
◉ WATERDEEPSEEK
CTX: 131K
DeepSeek-V3.2 is a large language model designed to harmonize high computational efficiency with strong reasoning and agentic tool-use performance.
► Reasoning60
► Tool Use50
#015Gemini 2.5 Flash LiteHP 210
⚡ ELECTRICGOOGLE
CTX: 1M
Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency.
#016Nemotron 3 UltraHP 200
▬ GROUNDNVIDIA
CTX: 1M
NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA, with 55B active parameters out of 550B total (MoE).
► Reasoning60
► Tool Use50
#017Gemini 2.5 FlashHP 210
⚡ ELECTRICGOOGLE
CTX: 1M
Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks.
#018gpt-oss-120bHP 30
⬡ STEELOPENAI
CTX: 131K
gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model from OpenAI designed for high-reasoning, agentic, and general-purpose production use cases.
► Reasoning60
► Tool Use50
#019Kimi K2.6HP 50
✦ FAIRYMOONSHOTAI
CTX: 262K
Kimi K2.6 is Moonshot AI's next-generation multimodal model, designed for long-horizon coding, coding-driven UI/UX generation, and multi-agent orchestration.
#020Laguna M.1HP 50
✱ ICEPOOLSIDE
CTX: 262K
Laguna M.1 is the flagship coding agent model from [Poolside](https://poolside.ai/), optimized for complex software engineering tasks.
► Reasoning60
► Tool Use50
#021MiMo-V2.5-ProHP 210
▲ DRAGONXIAOMI
CTX: 1M
MiMo-V2.5-Pro is Xiaomi’s flagship model, delivering strong performance in general agentic capabilities, complex software engineering, and l…
► Reasoning60
► Tool Use50
#022GLM 5.1HP 40
◼ DARKZ AI
CTX: 202K
GLM-5.1 delivers a major leap in coding capability, with particularly significant gains in handling long-horizon tasks.
► Reasoning60
► Tool Use50
#023GPT-4o-miniHP 30
⬡ STEELOPENAI
CTX: 128K
GPT-4o mini is OpenAI's newest model after [GPT-4 Omni](/models/openai/gpt-4o), supporting both text and image inputs with text outputs.
#024Gemini 3.5 FlashHP 210
⚡ ELECTRICGOOGLE
CTX: 1M
Gemini 3.5 Flash is Google's high-efficiency multimodal model, bringing near-Pro level coding and reasoning at Flash-tier cost and speed.
#025GPT-5.4HP 210
⬡ STEELOPENAI
CTX: 1M
GPT-5.4 is OpenAI’s latest frontier model, unifying the Codex and GPT lines into a single system.
#026Gemini 3.1 Flash LiteHP 210
⚡ ELECTRICGOOGLE
CTX: 1M
Gemini 3.1 Flash Lite is Google’s GA high-efficiency multimodal model optimized for low-latency, high-volume workloads.
#027Nemotron 3 SuperHP 200
▬ GROUNDNVIDIA
CTX: 1M
NVIDIA Nemotron 3 Super is a 120B-parameter open hybrid MoE model, activating just 12B parameters for maximum compute efficiency and accuracy in complex multi-agent applications.
► Reasoning60
► Tool Use50
#028Nex-N2-ProHP 50
◎ POISONNEX AGI
CTX: 262K
Nex-N2-Pro is an agentic mixture-of-experts model from Nex AGI, with 17B active parameters out of 397B total.
#029Gemma 4 26B A4B HP 50
⚡ ELECTRICGOOGLE
CTX: 262K
Gemma 4 26B A4B IT is an instruction-tuned Mixture-of-Experts (MoE) model from Google DeepMind.
#030Claude Opus 4.6HP 200
◈ PSYCHICANTHROPIC
CTX: 1M
Opus 4.6 is Anthropic’s strongest model for coding and long-running professional tasks.
#031Claude Haiku 4.5HP 40
◈ PSYCHICANTHROPIC
CTX: 200K
Claude Haiku 4.5 is Anthropic’s fastest and most efficient model, delivering near-frontier intelligence at a fraction of the cost and latency of larger Claude models.
#032GPT-5 MiniHP 80
⬡ STEELOPENAI
CTX: 400K
GPT-5 Mini is a compact version of GPT-5, designed to handle lighter-weight reasoning tasks.
#033Mistral NemoHP 30
◇ FLYINGMISTRALAI
CTX: 131K
A 12B parameter model with a 128k token context length built by Mistral in collaboration with NVIDIA.
► Tool Use50
► Structured Out30
#034Gemini 3.1 Pro PreviewHP 210
⚡ ELECTRICGOOGLE
CTX: 1M
Gemini 3.1 Pro Preview is Google’s frontier reasoning model, delivering enhanced software engineering performance, improved agentic reliabil…
#035MiniMax M2.7HP 40
◌ GHOSTMINIMAX
CTX: 204K
MiniMax-M2.7 is a next-generation large language model designed for autonomous, real-world productivity and continuous improvement.
► Reasoning60
► Tool Use50
#036Gemma 4 31BHP 50
⚡ ELECTRICGOOGLE
CTX: 262K
Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output.
#037GPT-5.4 MiniHP 80
⬡ STEELOPENAI
CTX: 400K
GPT-5.4 mini brings the core capabilities of GPT-5.4 to a faster, more efficient model optimized for high-throughput workloads.
#038Kimi K2.7 CodeHP 50
✦ FAIRYMOONSHOTAI
CTX: 262K
MoonshotAI: Kimi K2.7 Code is a coding-focused model in Moonshot AI's Kimi K2 family, built to complete end-to-end programming tasks reliably over long contexts.
#039GLM 5HP 40
◼ DARKZ AI
CTX: 202K
GLM-5 is Z.ai’s flagship open-source foundation model engineered for complex systems design and long-horizon agent workflows.
► Reasoning60
► Tool Use50
#040Gemini 3.1 Flash Lite PreviewHP 210
⚡ ELECTRICGOOGLE
CTX: 1M
Gemini 3.1 Flash Lite Preview is Google's high-efficiency model optimized for high-volume use cases.
#041Qwen3.7 PlusHP 200
◆ FIREQWEN
CTX: 1M
Qwen3.7-Plus is a cost-effective model in Alibaba's Qwen3.7 series.
#042Qwen3.7 MaxHP 200
◆ FIREQWEN
CTX: 1M
Qwen3.7-Max is the flagship model in Alibaba's Qwen3.7 series.
► Reasoning60
► Tool Use50
#043Kimi K2.5HP 50
✦ FAIRYMOONSHOTAI
CTX: 262K
Kimi K2.5 is Moonshot AI's native multimodal model, delivering state-of-the-art visual coding capability and a self-directed agent swarm paradigm.
#044gpt-oss-20bHP 30
⬡ STEELOPENAI
CTX: 131K
gpt-oss-20b is an open-weight 21B parameter model released by OpenAI under the Apache 2.0 license.
► Reasoning60
► Tool Use50
#045Qwen3 Embedding 8BHP 50
◆ FIREQWEN
CTX: —
EMBEDDING / SPECIAL MODEL.
#046GPT-5.4 NanoHP 80
⬡ STEELOPENAI
CTX: 400K
GPT-5.4 nano is the most lightweight and cost-efficient variant of the GPT-5.4 family, optimized for speed-critical and high-volume tasks.
#047Claude Sonnet 4.5HP 200
◈ PSYCHICANTHROPIC
CTX: 1M
Claude Sonnet 4.5 is Anthropic’s most advanced Sonnet model to date, optimized for real-world agents and coding workflows.
#048Laguna XS.2HP 50
✱ ICEPOOLSIDE
CTX: 262K
Laguna XS.2 is the second-generation model in the XS size class from [Poolside](https://poolside.ai/), their efficient coding agent series.
► Reasoning60
► Tool Use50
#049Qwen3 Next 80B A3B InstructHP 50
◆ FIREQWEN
CTX: 262K
Qwen3-Next-80B-A3B-Instruct is an instruction-tuned chat model in the Qwen3-Next series optimized for fast, stable responses without “thinking” traces.
► Tool Use50
► Structured Out30
#050GPT-4.1 MiniHP 210
⬡ STEELOPENAI
CTX: 1M
GPT-4.1 Mini is a mid-sized model delivering performance competitive with GPT-4o at substantially lower latency and cost.