Pay less. Get more.
Reduce your LLM token costs by 54%. Works with Claude Code and Claude Desktop.
The problem every AI power user knows
Token costs are rising
Every session consumes thousands of tokens for context that's rarely needed.
Compacting kills quality
When context gets compressed, important details are lost.
Manual optimization doesn't scale
Manually trimming CLAUDE.md? For every project? That doesn't scale.
3 steps to lower token costs
Install the plugin
One command. Works with Claude Code (hooks) and Claude Desktop (MCP).
npm install -g @simosphere/tokenizer-speedup
Rules match your context
YAML-based rules load only the context your prompts need.
See your savings
Real-time dashboard shows token savings and ROI.
Integration into your tooling — step by step.
Claude Code
Fully supportedAutomatic hook integration. The plugin injects optimized context with every prompt.
npm install -g @simosphere/tokenizer-speeduptsu setup --api-key YOUR_KEYWhat happens: With every prompt, the plugin loads only the relevant context. You see savings in the dashboard.
Claude Desktop
Available (v0.8.0)MCP server for Claude Desktop. Optimizes context automatically with every prompt.
npm install -g @simosphere/tokenizer-speedup{"mcpServers":{"tokenizer-speedup":{"command":"tsu-mcp"}}}What happens: Claude Desktop can now optimize context, analyze tokens and show savings via MCP tools.
ChatGPT / OpenAI
Token analysis availableLocal token analysis for OpenAI models. No live integration into ChatGPT sessions.
tsu analyze src/app.ts --provider openai --model gpt-4oCounts tokens locally with the OpenAI tokenizer. Ideal for cost estimates before sending.
Configuration
Create a project-local tokenizer-speedup.yaml for custom rules.
tsu initDefines keyword triggers, file patterns, and token limits. Each project can have its own rules.
Your data works for you — in every scenario.
Daily Feature Development
You use Claude Code or Claude Desktop daily for new features. TokenizerSpeedUp loads only the files your prompt needs — not the entire repository.
Code Reviews in a Team
5 developers review PRs and refactor code with Claude Code and Claude Desktop. Context Gating via hooks and MCP prevents old branches and irrelevant modules from being loaded.
Multi-Project Token Management
15 developers work across multiple client projects. Session Intelligence compresses context across projects and delivers budget reports per team.
Simple pricing. Pay only when you save.
Free
- ✓ Up to 100K tokens saved/month
- ✓ Basic dashboard
- ✓ Default YAML rules
- ✓ Local metrics
Pro
Popular- ✓ Unlimited token optimization
- ✓ Full analytics dashboard
- ✓ Custom YAML rules
- ✓ Priority support
10% of savings above 500K tokens
Start ProTeam
- ✓ Everything in Pro
- ✓ Team-wide rule sync
- ✓ Per-seat billing
- ✓ Admin dashboard
8% of savings above 1M tokens
Contact Sales54% token reduction in 2 weeks
54% token reduction in 2 weeks
Privacy first. Your prompts never leave your machine.
Local
The plugin runs entirely on your machine.
Numbers only
Only token counters are transmitted. No prompts, no code.
Transparent
Only token counters are sent. You stay in full control.
FAQ
Your Dashboard
Real-time insights into your token savings