MCP server that cuts 65% of tokens by compressing markdown files and activating caveman speak. No file distribution. No sync. No API key. Register once — works everywhere.
or run without installing: uvx caveman-mcp
01 // Why MCP
Caveman prompts used to require a file in every project. MCP changes everything.
Add caveman-mcp to your client config once. It activates across all your projects, forever. No per-repo setup.
The original caveman needed a prompt file in every repo. No more copy-paste. No stale prompts. No drift.
Zero credentials. Zero auth friction. Runs as a local stdio server — your data stays on your machine.
MCP exposes compress tools directly to the agent. compress_prepare → compress_write → compress_restore. The agent does the work.
Works with Claude Code, Cursor, Windsurf, Cline, and any MCP-compatible host. One install, five agents.
Wire up a Claude Code PostToolUse hook and every markdown file gets compressed automatically on read. Invisible. Zero effort.
02 // Compression Matrix
Payload: React Re-render Diagnosis
Same fix. 75% fewer tokens.
Average 65% reduction. Reasoning tokens unaffected. Output payload only.
03 // Intensity Levels
Six modes from terse to ancient. Pick your poison.
| Mode | Effect |
|---|---|
| lite | Drop filler. Keep full sentences and articles. |
| full | Default — drop articles, fragments OK, short synonyms. |
| ultra | Abbreviate (DB/auth/req/res/fn), strip conjunctions, X→Y causality. |
| wenyan-lite | Semi-classical Chinese register. |
| wenyan-full | Full 文言文 — 80–90% character reduction. |
| wenyan-ultra | Extreme. Ancient scholar feel. |
Available Prompts
Activate caveman compression
Terse commit message style
One-line code review comments
Quick-reference card
04 // Initialize Protocol
Claude Code — global config (recommended):
Or add via CLI:
Activate in any session: