model: "auto", PRYSM picks the best-value model
for each prompt; pass any id below to pin a single request to that model.
Prices are USD per million tokens (MTok). This page mirrors the live catalog —
fetch the current data any time from
GET /v1/models or
prysm models.Budget
Fast and inexpensive — ideal for classification, simple Q&A, and high-volume work.| ID | Model | Provider | Context | Input | Output | Strengths |
|---|---|---|---|---|---|---|
mistral-nemo | Mistral Nemo | mistral | 128K | $0.02 | $0.02 | classification, simple, fast |
gpt-5-nano | GPT-5 Nano | openai | 128K | $0.05 | $0.40 | simple, chat, fast |
gemini-2.5-flash-lite | Gemini 2.5 Flash-Lite | 1M | $0.10 | $0.40 | simple, translate, multimodal | |
llama-4-maverick | Llama 4 Maverick | meta | 128K | $0.27 | $0.85 | general, multilingual, open-source |
deepseek-v3.2 | DeepSeek V3.2 | deepseek | 128K | $0.28 | $0.42 | general, code, multilingual, value |
Mid
The everyday workhorses — strong quality at a sensible price across most tasks.| ID | Model | Provider | Context | Input | Output | Strengths |
|---|---|---|---|---|---|---|
grok-4.1 | Grok 4.1 | xai | 2M | $0.20 | $0.50 | realtime, news, long-context |
gpt-5-mini | GPT-5 Mini | openai | 128K | $0.25 | $2.00 | structured, json, tool-use |
qwen-2.5-72b | Qwen 2.5 72B | alibaba | 128K | $0.30 | $0.80 | math, chinese, multilingual, code |
kimi-k2 | Kimi K2 | moonshot | 128K | $0.35 | $1.50 | long-context, chinese, analysis |
mistral-medium-3 | Mistral Medium 3 | mistral | 131K | $0.40 | $2.00 | multilingual, european, code |
gemini-3-flash | Gemini 3 Flash | 1M | $0.50 | $3.00 | multimodal, fast, long-context | |
claude-haiku-4.5 | Claude Haiku 4.5 | anthropic | 200K | $1.00 | $5.00 | instruction, safety, chat, fast |
sonar | Sonar | perplexity | 128K | $1.00 | $1.00 | realtime, news, search, citations |
Premium
High-accuracy reasoning, writing, and code for higher-stakes work.| ID | Model | Provider | Context | Input | Output | Strengths |
|---|---|---|---|---|---|---|
deepseek-r1 | DeepSeek R1 | deepseek | 128K | $0.55 | $2.19 | reasoning, math, chain-of-thought |
gpt-5.2 | GPT-5.2 | openai | 128K | $1.75 | $14.00 | reasoning, agentic, structured, code |
gemini-3.1-pro | Gemini 3.1 Pro | 1M | $2.00 | $12.00 | multimodal, long-context, reasoning | |
claude-sonnet-4.5 | Claude Sonnet 4.5 | anthropic | 1M | $3.00 | $15.00 | writing, code, nuance, instruction |
sonar-pro | Sonar Pro | perplexity | 200K | $3.00 | $15.00 | realtime, news, search, citations, deep-search |
Frontier
Maximum capability for the hardest problems — reserved for when accuracy is paramount.| ID | Model | Provider | Context | Input | Output | Strengths |
|---|---|---|---|---|---|---|
grok-4.1-heavy | Grok 4.1 Heavy | xai | 2M | $3.00 | $15.00 | deep-reasoning, realtime, science |
claude-opus-4.6 | Claude Opus 4.6 | anthropic | 1M | $5.00 | $25.00 | complex-reasoning, writing, safety |
gpt-5.2-pro | GPT-5.2 Pro | openai | 128K | $21.00 | $168.00 | advanced-reasoning, science, legal |
Using a specific model
Pass anid from the tables above instead of auto to skip routing: