How much token can RTK save?

RTK saves 60-90% of tokens on average based on official data and user reports. Specifically: cargo test saves ~91%, git diff ~75%, grep searches ~80%, ls/tree ~80%. A 30-minute Claude Code session drops from ~118,000 tokens to ~23,900 tokens — roughly 80% saved.

Which AI coding tools does RTK support?

RTK supports Claude Code (rtk init -g), Cursor (rtk init --agent cursor), Copilot (rtk init -g), Gemini CLI (rtk init -g --gemini), Codex (rtk init -g --codex), Windsurf (rtk init --agent windsurf), Cline, Kilo Code, and more.

RTK applies four compression strategies to command output: Smart Filtering (removes noise like comments, whitespace, boilerplate), Grouping (aggregates similar items like files by directory), Truncation (keeps only the critical context skeleton), and Deduplication (collapses repetitive log lines with count summaries).

Will RTK affect my actual terminal output?

No. RTK only compresses the text returned to the AI model — your local terminal output stays complete and unchanged. RTK also provides a tee mechanism: on failure, it automatically saves the unfiltered full output to disk, so the model can still read the complete log if needed.

Development Tools Open Source Developer Tools Token Optimization AI Coding

RTK Review: Slash Your AI Coding Token Costs by 70% — Claude Code, Cursor & More

Xiaoxin Software AlternativesCreated2026-05-07

Have you ever noticed that while using Claude Code or Cursor, the model just wants a quick look at git status — but ends up consuming the entire terminal output? Every extra line you feed the context window costs real money on pay-per-token APIs.

RTK solves this. It’s a high-performance open-source tool written in Rust, acting as a CLI proxy between your terminal and the AI model. It filters, compresses, and deduplicates command output before it enters the context window. With over 45,000 GitHub stars, the project claims to reduce token consumption by 60-90% on common development workflows.

The Core Problem: Why Your AI Coding Is Burning Tokens

The actual cost in AI coding isn’t the command — it’s the command output. When you run tests, lints, build logs, directory trees, and git diffs repeatedly, most of the output is noise the model doesn’t need:

cargo test dumps 262 lines of “ok” — all identical
grep results repeat the same path prefix across dozens of matches
ls / tree recursive output occupies huge context space with little structural insight
git diff is 90% line numbers and context — the model only needs the changed lines

On pay-per-token setups (OpenAI, Anthropic, Gemini), this directly hits your bill. On flat-rate plans, it accelerates context exhaustion and forces session restarts.

What Is RTK

RTK (Rust Token Killer) is a single-binary CLI tool that acts as an intermediate proxy — compressing command output before it reaches the model. Think of it as a middle layer between your terminal and the AI:

1	You → AI Model → git status → Actual Command → RTK Compresses → Model gets clean result

Official benchmarks on a medium-sized TypeScript/Rust project in a 30-minute Claude Code session:

Operation	Frequency	Raw Tokens	With RTK	Saved
ls / tree	10x	2,000	400	~80%
cat / read	20x	40,000	12,000	~70%
grep / rg	8x	16,000	3,200	~80%
git status	10x	3,000	600	~80%
git diff	5x	10,000	2,500	~75%
cargo test / npm test	5x	25,000	2,500	~90%
Total	—	~118,000	~23,900	~80%

💡 Tip: The table is directional reference, not a guarantee. Use rtk gain to see your real project data.

Four Compression Strategies

RTK applies four targeted strategies per command type:

Smart Filtering — removes noise: strips redundant comment styles, blank lines, banners, and non-essential metadata. A cargo test output with 262 repeated “ok” entries gets compressed to a clean summary line.

Grouping — aggregates similar items: groups files by directory, consolidates identical errors — no more line-by-line repetition.

Truncation — keeps critical skeleton: preserves the essential context structure, trims secondary details. The model gets high-density information, not raw output.

Deduplication — collapses repetitive logs: repetitive log lines (like duplicate stack trace lines) are folded with count summaries.

Real Before/After Comparison

The official site shows a real cargo test comparison. The raw output is a 262-test verbose log; after RTK compression it becomes:

1	✓ cargo test: 262 passed (1 suite, 0.08s)

From 5,000+ tokens down to one clean line. That’s the difference between feeding the model “full饲料” and “high-density information”.

Quick Start

Installation

# Homebrew (macOS/Linux)
brew install rtk

# One-line install script
curl -fsSL https://rtk-ai.app/install.sh | sh

# Build from source with Cargo
cargo install --git https://github.com/rtk-ai/rtk

# Pre-built binaries for Windows/Linux/macOS
# Visit https://github.com/rtk-ai/rtk/releases

Integrate with AI Coding Tools

# Claude Code / Copilot (default)
rtk init -g

# Gemini CLI
rtk init -g --gemini

# Codex (OpenAI)
rtk init -g --codex

# Cursor
rtk init -g --agent cursor

# Windsurf
rtk init --agent windsurf

# Cline / Kilo Code etc.
rtk init --agent <tool-name>

After installation, restart your AI coding tool for the RTK hook to take effect.

Monitor Savings

# View overall savings
rtk gain

# With graph
rtk gain --graph

# Historical view
rtk gain --history

Privacy & Telemetry

RTK is clear about data collection: anonymous aggregate usage statistics may exist but are disabled by default. You must explicitly opt-in via rtk init or rtk telemetry enable. You can also force-disable via environment variable:

1	RTK_TELEMETRY=off rtk gain

For enterprise adoption, review docs/TELEMETRY.md in the repo and align with your internal security policy.

Key Takeaways

📌 Three things to know before using RTK:

Bash hook dependency: RTK hooks only work on Bash tool calls. Tools with built-in Read/Glob paths (like Cursor’s) won’t be automatically rewritten — explicitly call rtk read, rtk grep, rtk find instead

Tee mechanism as safety net: RTK’s tee mechanism automatically saves unfiltered full output on failure, so no critical information is lost even when compression is applied

Results vary by project: Compression rates depend on project size, command frequency, and output pattern. Official numbers are directional reference — use rtk gain to measure your actual project

Conclusion

RTK targets an often-overlooked cost center in AI coding: CLI noise. Its engineering is developer-friendly: single binary, broad coverage, multi-tool integration, and rtk gain for real-time quantification.

But manage expectations: it’s not magic. For tool paths that bypass bash hooks, and for cases where you genuinely need complete output, it won’t perform lossless compression. It works best as default infrastructure — making the model less burdened by noise, so your tokens buy “information density” instead of “repeated content”.

👉 Website: https://www.rtk-ai.app/
👉 GitHub: https://github.com/rtk-ai/rtk