v3.0 โ€” Live APIs Active

One API. 6 Models.
70% Cheaper.

Route AI queries through Claude, GPT, Gemini โ€” automatically picking the cheapest model that meets quality requirements. OpenAI-compatible endpoint.

$0.003
Avg cost/query
6
Models
70%
Cost savings
3
Providers

Smart Routing, Not Just Proxying

Omnisphere analyzes your query complexity and routes to the optimal model automatically.

โšก

Cost-Aware Routing

Simple queries โ†’ Gemini Flash ($0.0001). Complex reasoning โ†’ Claude Sonnet ($0.01). You always get the best price for your needs.

๐Ÿ”—

OpenAI-Compatible

Drop-in replacement for /v1/chat/completions. Change one URL, save 70%. Works with any OpenAI SDK.

๐Ÿ”€

Multi-Model Consensus

Query 3 models simultaneously. Get weighted synthesis. Higher quality than any single model.

๐Ÿ“Š

Real-Time Metrics

Track cost per model, latency, tokens, success rates. SQLite-backed analytics built in.

๐Ÿ›ก๏ธ

Automatic Fallback

If Claude is down, route to GPT. If GPT is rate-limited, fall to Gemini. Zero downtime.

๐Ÿ’ฐ

Arbitrage Engine

Three pricing tiers: cheap, balanced, premium. The router picks the tier based on query complexity.

Pricing Tiers

Pay per query. No subscriptions. No minimums.

Cheap Tier

$0.0001
per query (avg)
  • Gemini 2.0 Flash
  • GPT-4o-mini
  • Claude Haiku 3.5
  • Best for: Simple Q&A, summaries
  • ~10,000 queries for $1

Premium Tier

$0.01
per query (avg)
  • Claude Sonnet 4
  • GPT-4o
  • Gemini 2.5 Pro
  • Best for: Research, strategy, complex
  • ~100 queries for $1

API โ€” Drop-in OpenAI Replacement

Change one line of code. Save 70%. Get your API key instantly.

# 1. Get your API key (free tier: 60 req/min, $1/mo budget)
# Contact: moltbot@ceooftheuniverse.com

# 2. Use it exactly like OpenAI:
curl https://your-omnisphere-url/v1/chat/completions \
  -H "Authorization: Bearer omni_your_key_here" \
  -H "Content-Type: application/json" \
  -d '{"model":"claude-sonnet-4","messages":[{"role":"user","content":"Hello"}]}'

# Smart routing โ€” auto-picks cheapest model:
curl -X POST /api/query \
  -H "Authorization: Bearer omni_xxx" \
  -d '{"prompt":"What is 2+2?","tier":"ultraCheap"}'

# Multi-model consensus:
curl -X POST /api/consensus \
  -H "Authorization: Bearer omni_xxx" \
  -d '{"prompt":"Analyze market trends","tier":"premium"}'
๐Ÿ”‘

API Key Tiers

Free: 60 req/min, $1/mo budget
Pro: 120 req/min, $25/mo
Enterprise: 300 req/min, $50/mo+
Per-query billing, no waste.

๐Ÿ“Š

Usage Dashboard

Track spend per model, per day. Real-time cost alerts. Monthly budget caps prevent surprise bills. Admin API for programmatic access.

๐Ÿ›ก๏ธ

Rate Limiting

Per-key rate limits. Budget enforcement. Automatic fallback when one provider is rate-limited. Zero-config resilience.