On this page
What it does Who it's for Getting started

What is Clawzempic?

A drop-in LLM proxy that reduces costs 70-95% while adding memory, security, and observability — without changing your application code.

You point your SDK at our base URL, and every request flows through an optimization pipeline that reduces costs 70-95% while adding memory, security, and observability — without changing your application code.

It works with OpenRouter (300+ models) and Anthropic (Claude) API keys.

What it does

When a request hits Clawzempic, it passes through several optimization layers before reaching the upstream provider:

  1. Routing — Scores the complexity of each request and routes simple tasks to cheaper models
  2. Caching — Injects prompt cache breakpoints so repeated context is served at 90% discount
  3. Windowing — Compresses long conversations by summarizing old messages
  4. Memory — Injects persistent facts and recent activity into the system prompt
  5. Security — Scans tool results for injection attacks and redacts credentials
  6. Scripts — Intercepts trivial queries (math, time) and responds instantly at zero cost

Your bot gets smarter and cheaper. Your code stays the same.

Who it's for

  • Bot builders running Claude or OpenRouter-powered agents who want to cut costs without sacrificing quality
  • Teams who need persistent memory across conversations and channels
  • Anyone tired of paying Opus prices for "what time is it?"

Getting started

The fastest path is a single command:

bash
npx clawzempic

This walks you through account creation, key storage, and framework detection. See the Quickstart for the full walkthrough.