Using prxy.monster with Cline
Cline (formerly Claude Dev, also forked as Roo Code) is a VS Code extension for autonomous coding agents. It supports custom OpenAI-compatible endpoints via its provider settings.
Configure
- Open VS Code → Cline extension → Settings (gear icon)
- Under API Provider, select OpenAI Compatible
- Set:
- Base URL:
https://api.prxy.monster/v1 - API Key:
prxy_live_xxxxxxxxxxxxxxxxxxxxxxxx - Model ID:
claude-sonnet-4-6(orgpt-4o,gemini-2.0-flash, etc.)
- Base URL:
That’s it. Click “Done” and start a new task. Cline routes every request through prxy.monster.
Cline can also use Anthropic as a direct provider. If you select that, set the Anthropic Base URL to https://api.prxy.monster instead. Same effect.
Code change
None — Cline is a VS Code extension.
Verify
curl https://api.prxy.monster/healthStart a Cline task with a simple prompt — successful response confirms routing.
What you get
Cline runs long autonomous agentic tasks, often involving many tool calls and multi-step plans. This is exactly the workload prxy.monster is built for:
- MCP optimization — Cline can use MCP servers; the
mcp-optimizermodule prunes irrelevant tool defs per request, saving major tokens. - Pattern memory — Cline’s repeated workflows (“write a test for this”, “implement this PRD step”, “fix this lint error”) get learned across tasks.
- Infinite context — long autonomous tasks stop hitting the context wall via the
ipcmodule. - Cost guards — Cline can rack up serious cost on big tasks. The
cost-guardmodule enforces a hard ceiling. - Semantic cache — Cline often re-reads / re-analyzes the same files; cached responses save tokens.
Recommended pipeline
PRXY_PIPE=mcp-optimizer,semantic-cache,patterns,ipc,cost-guardThe cost-guard is especially important here — Cline can spend $5-20 per non-trivial task without one.
Common issues
- “Cline’s request format” / “context size errors” — Cline ships large system prompts. The
ipcmodule helps but can’t shrink an explicit system prompt mid-conversation. Make sure you’ve selected a model with a large enough context window. - Streaming — Cline uses streaming. Pass-through.
- Tool use — Cline’s file-edit / shell-execute tools are client-side; the LLM just emits tool-call payloads. prxy.monster sees those and the
mcp-optimizerdoesn’t touch them (it only prunes MCP tools, not Cline’s built-ins).
Roo Code
Roo Code is a Cline fork with extra features. The settings layout is the same — same OpenAI Compatible / Anthropic Base URL fields.
Full example
GUI-only setup, no example repo needed. The screenshots-style walkthrough above is the entire integration.
Verify the current settings layout with the Cline docs — VS Code extension UIs evolve. The OpenAI Compatible / Custom Base URL pattern has been stable.