Gallery
About
I built this after hitting the same wall repeatedly — no good way to enforce token budgets in application code. Provider caps are account-level and tell you what happened, not what is happening.Two ways to add it: # Direct client wrapper client = tokencap.wrap(anthropic.Anthropic(), limit=50_000) # LangChain, CrewAI, AutoGen, etc. tokencap.patch(limit=50_000) Four actions at configurable thresholds: WARN, DEGRADE (transparent model swap), BLOCK, and WEBHOOK. SQLite out of the box, Redis for multi-agent setups.One design decision worth mentioning: tokencap tracks tokens, not dollars. Token counts come directly from the provider response and never drift with pricing changes.Happy to answer any questions.
Comments (0)
No comments yet. Be the first to comment!
Related Products
OpenBrief – Local-first video downloader/summarizer
Nerve – self hosted runtime for AI agents
skills-for-humanity – 171 structured reasoning skills for Claude Code
skills-for-humanity – 171 structured reasoning skills for Claude Code
OpenBrief – Local-first video downloader/summarizer
Bae – AI companion built around persistent memory architecture