Plexus

A universal LLM API gateway and transformation layer.

Discord Community | API Reference | Configuration | Installation | Testing

Plexus sits in front of your LLM providers and exposes one consistent API surface for OpenAI, Anthropic, Gemini, OpenAI-compatible providers, OAuth-backed subscriptions, MCP servers, and more. It handles protocol translation, routing, failover, usage tracking, and provider-specific quirks so clients can switch models without rewriting code.

Highlights

Unified API surface for OpenAI Chat Completions, OpenAI Responses, Anthropic Messages, Gemini, embeddings, audio, and images.
Provider routing and load balancing across OpenAI, Anthropic, Google Gemini, DeepSeek, Groq, OpenRouter, and any OpenAI-compatible backend.
OAuth-backed providers for GitHub Copilot, Anthropic Claude, OpenAI Codex, Gemini CLI, and Antigravity through the Admin UI.
Model aliases that map virtual model names to one or more real provider targets using random, in_order, cost, performance, latency, or e2e_performance selectors.
Vision fallthrough that describes images with a vision-capable descriptor model before routing to non-vision models.
Automatic failover with exponential provider cooldowns and optional stall detection for slow or stuck streams.
Usage, quota, and cost controls with per-request logs, token counts, latency, TPS, and per-API-key limits.
Admin dashboard for configuration, analytics, debug traces, provider health, and quota monitoring.
MCP proxying with isolated per-request sessions for streamable HTTP MCP servers.
Encryption at rest for API keys, OAuth tokens, provider secrets, and MCP headers.

Quick Start

ADMIN_KEY is required for the dashboard and management API. DATABASE_URL is optional and defaults to SQLite at ./data/plexus.db; use a PostgreSQL connection string for production.

Docker

docker run -p 4000:4000 \
  -v plexus-data:/app/data \
  -e ADMIN_KEY="your-admin-password" \
  -e ENCRYPTION_KEY="your-generated-hex-key" \
  ghcr.io/mcowger/plexus:latest

Standalone Binary

Download a pre-built binary from GitHub Releases:

# macOS Apple Silicon
curl -L https://github.com/mcowger/plexus/releases/latest/download/plexus-macos -o plexus
chmod +x plexus
ADMIN_KEY="your-admin-password" ./plexus

# Linux x64
curl -L https://github.com/mcowger/plexus/releases/latest/download/plexus-linux -o plexus
chmod +x plexus
ADMIN_KEY="your-admin-password" ./plexus

# Windows x64
Invoke-WebRequest -Uri "https://github.com/mcowger/plexus/releases/latest/download/plexus.exe" -OutFile "plexus.exe"
$env:ADMIN_KEY = "your-admin-password"
$env:DATABASE_URL = "sqlite://./data/plexus.db"
.\plexus.exe

The binary is self-contained; database migrations and the web dashboard are embedded. See Installation for Docker Compose, Windows troubleshooting, source builds, and environment variables.

Try It

Open the dashboard at http://localhost:4000, then create/configure an API key and model alias. Send a request:

curl -X POST http://localhost:4000/v1/chat/completions \
  -H "Authorization: Bearer sk-plexus-my-key" \
  -H "Content-Type: application/json" \
  -d '{"model": "fast", "messages": [{"role": "user", "content": "Hello!"}]}'

OAuth providers are configured in the Admin UI. See Configuration: OAuth Providers.

Screenshots


Dashboard — Request volume, token usage, cost trends, and top models.	Providers — Configured providers with status, quota indicators, and controls.

Request Logs — Per-request model, provider, tokens, cost, latency, and live stream throughput.	Model Aliases — Virtual model names, targets, selectors, and routing priorities.

Feature Notes

Protocol Translation

Plexus accepts OpenAI Chat Completions (/v1/chat/completions), OpenAI Responses (/v1/responses), Anthropic Messages (/v1/messages), Gemini native requests, and OpenAI-compatible provider formats. Requests can be translated between providers in both directions, including streaming and tool use. See the API Reference.

Routing

Model aliases can target one or more providers and choose targets by randomness, order, cost, measured performance, latency, or end-to-end performance. priority: api_match prefers providers that natively speak the incoming API format. See Configuration: models.

Vision Fallthrough

Vision fallthrough lets image requests work with non-vision target models. Plexus sends images to a descriptor model, inserts the generated descriptions into the request, and routes the transformed request to the configured target. Enable it per model alias in the Admin UI and configure the descriptor model in settings.

Quotas, Cooldowns, and Stream Safety

Per-key quotas can limit tokens, requests, or cost across rolling, daily, or weekly windows. Failed providers are automatically cooled down with exponential backoff, and stream protection can cancel upstream requests on client disconnect, timeout stalled providers, and show live throughput in request logs. See Configuration.

MCP Proxy

Plexus can proxy streamable HTTP Model Context Protocol servers with isolated sessions per request. See Configuration: MCP Servers.

Encryption

Set ENCRYPTION_KEY to enable AES-256-GCM encryption for sensitive database fields:

export ENCRYPTION_KEY="$(openssl rand -hex 32)"

Existing plaintext values are encrypted on first startup with a key. See Configuration: Encryption.

Admin CLI

Pass a subcommand as the first argument to the binary or bun run src/index.ts:

rekey decrypts sensitive fields with the current ENCRYPTION_KEY and re-encrypts them with NEW_ENCRYPTION_KEY.
migrate-quota-snapshots copies legacy quota_snapshots rows into meter_snapshots; it is idempotent and safe to rerun.

ENCRYPTION_KEY="<current-key>" NEW_ENCRYPTION_KEY="<new-key>" ./plexus rekey
DATABASE_URL=sqlite://./data/plexus.db ./plexus migrate-quota-snapshots

Development

bun run setup:hooks
bun run test

bun test is intentionally blocked; use bun run test. See Testing.

License

MIT License — see LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 1,500 Commits
.agents/skills		.agents/skills
.config		.config
.github		.github
docs		docs
packages		packages
scripts		scripts
test/bun-test-guard		test/bun-test-guard
.dockerignore		.dockerignore
.gitattributes		.gitattributes
.gitignore		.gitignore
.vscodeignore		.vscodeignore
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
biome.json		biome.json
bun.lock		bun.lock
bunfig.toml		bunfig.toml
docker-compose.yml		docker-compose.yml
lefthook.yml		lefthook.yml
mise.toml		mise.toml
package.json		package.json
paseo.json		paseo.json
redocly.yaml		redocly.yaml
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Plexus

Discord Community | API Reference | Configuration | Installation | Testing

Highlights

Quick Start

Docker

Standalone Binary

Try It

Screenshots

Feature Notes

Protocol Translation

Routing

Vision Fallthrough

Quotas, Cooldowns, and Stream Safety

MCP Proxy

Encryption

Admin CLI

Development

License

About

Uh oh!

Releases 443

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Plexus

Discord Community | API Reference | Configuration | Installation | Testing

Highlights

Quick Start

Docker

Standalone Binary

Try It

Screenshots

Feature Notes

Protocol Translation

Routing

Vision Fallthrough

Quotas, Cooldowns, and Stream Safety

MCP Proxy

Encryption

Admin CLI

Development

License

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 443

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages