Skip to Content
prxy.monster v1 is in early access. See what shipped →
Changelog

Changelog

v1.0 — early access (Q2 2026)

Modules — production

  • mcp-optimizer — embed tool descriptions, drop irrelevant ones. 88% token reduction typical.
  • exact-cache — hash-based identical-request cache.
  • semantic-cache — embedding-similarity cache.
  • cost-guard — per-request, per-day, per-month USD caps.
  • patterns — persistent memory; injects relevant patterns, forges new ones from outcomes.
  • ipc — Infinite Persistent Context; compresses old turns instead of dropping them.

Storage

  • Cloud: Postgres + pgvector + Upstash Redis + R2.
  • Local: SQLite + sqlite-vec (optional) + filesystem blob.
  • Same StorageAdapter interface across both.

API surface

  • POST /v1/messages — Anthropic-compatible.
  • POST /v1/chat/completions — OpenAI-compatible.
  • GET /v1/pipeline — inspect the active module pipeline.
  • /v1/keys — API key CRUD.
  • /v1/billing/* — Stripe Checkout, Customer Portal, balance lookup.
  • GET /health — liveness + provider config view.

SDK

  • @prxy/module-sdkModule interface, StorageAdapter interface, helper types.
  • @prxy/shared-types — canonical request/response types.

Local mode

  • Single Docker image: prxymonster/local.
  • Free, forever.
  • Bypassed auth, no billing, no telemetry.

Auth

  • argon2id-hashed API keys.
  • 5-minute LRU cache for hot keys.
  • Email/password auth (Clerk integration deferred).

Billing

  • Stripe Checkout + Customer Portal.
  • Per-tier monthly quota + auto-recharge.
  • BYOK pricing model.

v1.1 — coming next quarter

New modules

  • router — fallback chains + smart model selection.
  • rehydrator — pull archived context back when referenced.
  • compaction-bridge — survive Claude Code auto-compaction.
  • prompt-optimizer — restructure prompts for max provider cache hits.
  • tool-cache — cache MCP tool result calls.
  • guardrails — PII redaction, profanity blocks, custom regex.
  • airgap — enforce zero non-provider outbound network calls.

Other

  • /v1/export and /v1/import for migrating between local and cloud.
  • npm-package module loading (load custom modules without a local file path).
  • Open-source dashboard package for local mode.
  • Post-hook support for streaming responses.
  • Lua INCR-based rate limiter (replaces naive get/set).

v1.2 — planned

  • evals module — A/B test prompts and modules.
  • replay module — replay historical traffic against new pipelines.
  • branch module — fork conversations, compare outcomes.
  • Hybrid local-cloud sync.
  • pattern-sync module — sync patterns across devices.

v2.0 — long-term

  • Q-learning router — learn which model best handles which query type.
  • collective module — opt-in anonymized cross-user pattern sharing.
  • Multi-region cloud deployments.
  • SOC 2 + HIPAA compliance certifications.
Last updated on