Now supporting MCP & multi-modal routing

One gateway.
any model

Prism connects your stack to 350+ AI models across every modality — with intelligent routing, guardrails, RAG, and full observability. One API. Enterprise-grade.

350+
Models
99.99%
Uptime SLA
<50ms
P99 latency
prism.ts
// Intelligent routing across 350+ models
import { Prism } from '@prismgateway/sdk'

const prism = new Prism({
  apiKey: process.env.PRISM_API_KEY
})

// Route to the best model automatically
const response = await prism.chat({
  model: 'auto',
  messages: [{
    role: 'user',
    content: 'Summarize this document'
  }]
})

// ✓ Routed to claude-3-7-sonnet (best cost/quality)
// ✓ Guardrails applied — 0 PII detected
// ✓ Logged to observability dashboard
Routed in 12ms claude-3-7-sonnet

Trusted by teams building the next generation of AI products

Acme Corp
Meridian AI
Vertex Labs
Nova Systems
Crestline
Apex Data
Acme Corp
Meridian AI
Vertex Labs
Nova Systems
Crestline
Apex Data
350+
AI Models
10B+
Tokens/month
99.99%
Uptime SLA
<50ms
Added latency
The problem

Fragmented AI APIs are slowing your team down

Each provider has a different SDK, auth method, and error format

No unified billing or cost visibility across Claude, GPT, Gemini

Zero observability — you don't know what's failing or why

Compliance teams can't audit AI requests without custom tooling

The solution

One gateway for your entire AI stack

One OpenAI-compatible API for every model and modality

Unified billing dashboard with per-model cost breakdown

Full request tracing, latency analytics, and anomaly detection

Built-in guardrails, PII detection, and audit logs out of the box

See all features
Features

Everything your AI stack needs

Built for engineering teams that need reliability, control, and visibility — without sacrificing developer experience.

Smart Routing

Automatically route requests to the best model based on cost, latency, quality, and capability. Fall back gracefully on errors.

Guardrails

Enforce content policies, detect PII, block prompt injections, and ensure compliance — before responses reach your users.

Multi-modal

Text, images, audio, video, and embeddings — unified through one API. Switch modalities without rewriting your integration.

RAG & Retrieval

Connect your vector stores, knowledge bases, and document pipelines. Built-in chunking, embedding, and hybrid retrieval.

MCP Support

First-class Model Context Protocol support. Standardize how your agents and tools communicate across any model provider.

Observability

Full request tracing, cost analytics, latency percentiles, and anomaly detection. Know exactly what your AI spend is doing.

Developer-first

Works with your existing code

Prism is fully OpenAI-compatible. Change one line — baseURL — and you're done. No new SDK to learn, no migration cost.

Works with any OpenAI SDK (Python, TypeScript, Go, Ruby)

Zero code changes beyond one environment variable

Same request/response format you already know

Streaming, function calling, and embeddings all supported

Read the docs
prism-sdk
// Drop-in replacement — just change baseURL
import OpenAI from 'openai'

const client = new OpenAI({
  baseURL: 'https://api.prismgateway.io/v1',
  apiKey: process.env.PRISM_API_KEY,
})

const response = await client.chat.completions.create({
  model: 'auto',   // Prism picks the best model
  messages: [{ role: 'user', content: 'Summarize this doc' }]
})

// ✓ Routed → claude-3-7-sonnet  ✓ Guardrails applied  ✓ Cost: $0.002
Gateway online · p99 <50ms OpenAI-compatible
Pricing

Simple, transparent pricing

Start free. Scale as you grow. No hidden fees.

Monthly Annual Save 20%

Starter

For individuals and side projects

$0

No credit card required

Get started free
  • 5M tokens/month
  • 10 model providers
  • Basic smart routing
  • 7-day request logs
  • Community support
Most popular

Pro

For growing teams shipping AI products

$49 /month
Start 14-day trial
  • 100M tokens/month
  • All 350+ models
  • Smart routing + fallbacks
  • Guardrails & PII detection
  • Full observability
  • Cost analytics & budgets
  • Email + Slack support

Enterprise

For organizations with advanced needs

Custom
Talk to sales
  • Unlimited tokens
  • Custom model integrations
  • SSO / SAML
  • SOC 2 Type II
  • Audit logs (unlimited)
  • 99.99% SLA
  • Dedicated support engineer

Start building with every AI model today

Free tier includes 5M tokens/month. No credit card required. Production-ready in minutes.