Route AI queries through Claude, GPT, Gemini โ automatically picking the cheapest model that meets quality requirements. OpenAI-compatible endpoint.
Omnisphere analyzes your query complexity and routes to the optimal model automatically.
Simple queries โ Gemini Flash ($0.0001). Complex reasoning โ Claude Sonnet ($0.01). You always get the best price for your needs.
Drop-in replacement for /v1/chat/completions. Change one URL, save 70%. Works with any OpenAI SDK.
Query 3 models simultaneously. Get weighted synthesis. Higher quality than any single model.
Track cost per model, latency, tokens, success rates. SQLite-backed analytics built in.
If Claude is down, route to GPT. If GPT is rate-limited, fall to Gemini. Zero downtime.
Three pricing tiers: cheap, balanced, premium. The router picks the tier based on query complexity.
Pay per query. No subscriptions. No minimums.
Change one line of code. Save 70%. Get your API key instantly.
# 1. Get your API key (free tier: 60 req/min, $1/mo budget)
# Contact: moltbot@ceooftheuniverse.com
# 2. Use it exactly like OpenAI:
curl https://your-omnisphere-url/v1/chat/completions \
-H "Authorization: Bearer omni_your_key_here" \
-H "Content-Type: application/json" \
-d '{"model":"claude-sonnet-4","messages":[{"role":"user","content":"Hello"}]}'
# Smart routing โ auto-picks cheapest model:
curl -X POST /api/query \
-H "Authorization: Bearer omni_xxx" \
-d '{"prompt":"What is 2+2?","tier":"ultraCheap"}'
# Multi-model consensus:
curl -X POST /api/consensus \
-H "Authorization: Bearer omni_xxx" \
-d '{"prompt":"Analyze market trends","tier":"premium"}'
Free: 60 req/min, $1/mo budget
Pro: 120 req/min, $25/mo
Enterprise: 300 req/min, $50/mo+
Per-query billing, no waste.
Track spend per model, per day. Real-time cost alerts. Monthly budget caps prevent surprise bills. Admin API for programmatic access.
Per-key rate limits. Budget enforcement. Automatic fallback when one provider is rate-limited. Zero-config resilience.