Drop-in for OpenAI
Change one line — the base URL. Every OpenAI SDK call works unchanged.
Smart routing
Intent classification picks the right model for each prompt: quality, balanced, or agility.
BRAIN.md config
A declarative, version-controlled routing file — the
.cursorrules of model routing.Cost guardrails
AgentGuard caps per-request spend and blocks models you don’t want agents touching.
PrysmProof receipts
Every response carries a tamper-evident SHA-256 receipt of what ran and why.
MCP server
Give Claude Desktop, Cursor, or Windsurf cost-aware routing as a native tool.
Why PRYSM
The model you reach for by default is rarely the best one for the task in front of you. A frontier model on a one-line classification is wasted money; a budget model on a nuanced contract is wasted quality. PRYSM makes that decision per request — so you get the right model every time without hand-tuning, while spend stays predictable.Better outputs
Each prompt goes to the model that’s actually best at it — code, writing, math,
translation, reasoning, and more.
Lower cost
Cheap models handle the easy 80%; premium models are reserved for the hard 20%.
Typical savings are large versus an all-premium baseline.
Zero lock-in
OpenAI-compatible. No new SDK to learn, no rewrite, no juggling nine provider keys.
The drop-in pattern
PRYSM speaks the OpenAI API. Setmodel: "auto" and let PRYSM choose:
What’s in every response
PRYSM returns a standard OpenAI payload plus a top-levelprysm block describing
the decision — which model ran, why, what it cost, latency, and a verifiable proof:
prysm field. The PRYSM SDKs expose it
cleanly through extension().
Next steps
Quickstart
Make your first routed request in under five minutes.
Authentication
API keys, the base URL, and environment variables.
How routing works
Modes, signals, and how PRYSM picks a model.
API reference
Every endpoint, parameter, and response field.