Changelog
v1.0 — early access (Q2 2026)
Modules — production
mcp-optimizer— embed tool descriptions, drop irrelevant ones. 88% token reduction typical.exact-cache— hash-based identical-request cache.semantic-cache— embedding-similarity cache.cost-guard— per-request, per-day, per-month USD caps.patterns— persistent memory; injects relevant patterns, forges new ones from outcomes.ipc— Infinite Persistent Context; compresses old turns instead of dropping them.
Storage
- Cloud: Postgres + pgvector + Upstash Redis + R2.
- Local: SQLite + sqlite-vec (optional) + filesystem blob.
- Same
StorageAdapterinterface across both.
API surface
POST /v1/messages— Anthropic-compatible.POST /v1/chat/completions— OpenAI-compatible.GET /v1/pipeline— inspect the active module pipeline./v1/keys— API key CRUD./v1/billing/*— Stripe Checkout, Customer Portal, balance lookup.GET /health— liveness + provider config view.
SDK
@prxy/module-sdk—Moduleinterface,StorageAdapterinterface, helper types.@prxy/shared-types— canonical request/response types.
Local mode
- Single Docker image:
prxymonster/local. - Free, forever.
- Bypassed auth, no billing, no telemetry.
Auth
- argon2id-hashed API keys.
- 5-minute LRU cache for hot keys.
- Email/password auth (Clerk integration deferred).
Billing
- Stripe Checkout + Customer Portal.
- Per-tier monthly quota + auto-recharge.
- BYOK pricing model.
v1.1 — coming next quarter
New modules
router— fallback chains + smart model selection.rehydrator— pull archived context back when referenced.compaction-bridge— survive Claude Code auto-compaction.prompt-optimizer— restructure prompts for max provider cache hits.tool-cache— cache MCP tool result calls.guardrails— PII redaction, profanity blocks, custom regex.airgap— enforce zero non-provider outbound network calls.
Other
/v1/exportand/v1/importfor migrating between local and cloud.- npm-package module loading (load custom modules without a local file path).
- Open-source dashboard package for local mode.
- Post-hook support for streaming responses.
- Lua INCR-based rate limiter (replaces naive get/set).
v1.2 — planned
evalsmodule — A/B test prompts and modules.replaymodule — replay historical traffic against new pipelines.branchmodule — fork conversations, compare outcomes.- Hybrid local-cloud sync.
pattern-syncmodule — sync patterns across devices.
v2.0 — long-term
- Q-learning router — learn which model best handles which query type.
collectivemodule — opt-in anonymized cross-user pattern sharing.- Multi-region cloud deployments.
- SOC 2 + HIPAA compliance certifications.
Last updated on