token-guard
SKILL.md
Token Guard ð°
Stop burning money while you sleep.
Your AI agent runs 24/7. Opus costs $75/M output tokens. One overnight coding session can burn $50+ before you wake up. Token Guard watches your spend and acts before your wallet notices.
The Problem
Real costs from real users:
| Scenario | Model | Cost |
|---|---|---|
| "Let the agent run overnight" | Opus | $50-100 |
| "Register an email + X account" | Opus | $55 (one task!) |
| First week exploring | Opus | $100+ |
| Daily heartbeat checks | Opus | $2-5/day for nothing |
Most users don't realize they're burning Opus-level tokens on tasks that Haiku could handle.
What Token Guard Does
Monitor
- Track token usage per day/model
- Estimate costs from model pricing
- Show usage history and trends
Budget
- Set daily spending limits
- Warning alerts at configurable threshold (default 80%)
- Hard limits with auto-model-downgrade
Optimize
- Model cost comparison at a glance
- Routing recommendations (Opus for reasoning, Haiku for background)
- Per-task cost estimation before running
Quick Start
# See current usage and model costs
bash scripts/token-guard.sh status
# Set a $10/day budget with warnings
bash scripts/token-guard.sh set-budget 10
# Set a hard limit: auto-downgrade to Haiku when exceeded
bash scripts/token-guard.sh set-budget 10 80 true claude-haiku-3-5
# Estimate cost before running a task
bash scripts/token-guard.sh estimate claude-opus-4 50000 10000 10
# Check usage history
bash scripts/token-guard.sh history 7
Model Cost Comparison
| Model | Input $/1M | Output $/1M | Relative |
|---|---|---|---|
| Claude Opus 4 | $15.00 | $75.00 | 100x |
| Claude Sonnet 4 | $3.00 | $15.00 | 20x |
| Claude Haiku 3.5 | $0.80 | $4.00 | 5x |
| GPT-4o | $2.50 | $10.00 | 13x |
| Gemini Flash | $0.075 | $0.30 | 1x |
| DeepSeek | $0.27 | $1.10 | 1.5x |
Recommended Routing
Tier 1 (Reasoning/Creative): claude-opus-4 â Use sparingly
Tier 2 (Daily work): claude-sonnet-4 â Primary model
Tier 3 (Background/Subagent): claude-haiku-3-5 â Subagent model
Tier 4 (Bulk/Heartbeat): gemini-2.0-flash â Heartbeat/cron
Savings: Routing background tasks to Haiku instead of Opus = 95% cost reduction on those tasks.
For AI Agents
Add to your heartbeat or cron:
bash /path/to/token-guard/scripts/token-guard.sh monitor
When budget is exceeded with hard limit enabled, the script automatically downgrades the primary model.
Install
clawdhub install token-guard
# or: git clone https://github.com/jzOcb/token-guard
Requirements
bash4+python3curl
Related
- config-guard â Config validation and auto-rollback
- upgrade-guard â Safe upgrades with snapshot and auto-rollback
- agent-guardrails â Code-level enforcement for AI agents
Weekly Installs
14
Repository
aaaaqwq/claude-âĶe-skillsGitHub Stars
11
First Seen
12 days ago
Security Audits
Installed on
opencode13
gemini-cli13
github-copilot13
amp13
cline13
codex13