Changelog¶
All notable changes to llm-toll are documented here.
v0.13.0¶
- Round all costs to 10 decimal places (
COST_ROUND_PLACES) to prevent floating-point drift - Web dashboard for cost trends and usage analytics
v0.12.0¶
- PostgreSQL backend for team-wide cost tracking
PostgresStorewithThreadedConnectionPooland row-level lockingcreate_store()factory function with URL-based backend selectionLLM_TOLL_STORE_URLenvironment variable support- Graceful fallback to SQLite on connection failure
v0.11.0¶
- Auto-updating pricing registry from remote source
update_pricing()function with local caching (24h TTL)--update-pricingCLI commandload_remote_pricing()for bulk pricing updates
v0.10.0¶
- LangChain callback integration (
LangChainCallback) - Pre-call budget check in
on_llm_start - Post-call usage logging in
on_llm_end
v0.9.0¶
- LiteLLM callback integration (
LiteLLMCallback) - Model name normalization for LiteLLM provider prefixes
- Zero-decorator cost tracking for all LiteLLM calls
v0.8.0¶
- Hardening batch 3: TPM retry_after calculation, prefix boundary matching, bounded dynamic cache
- Protected user-registered models from cache eviction
- Fixed prefix resolution race condition
v0.7.0¶
- Hardening batch 2: TOCTOU documentation, mid-stream break test, coverage CI
- Hardening batch 1: version single source of truth, context manager support, pricing validation
v0.6.0¶
- CLI dashboard for viewing costs and usage statistics
--stats,--reset,--export csvcommands--projectand--modelfilters- Color-coded cost output with
NO_COLORsupport
v0.5.0¶
- Async decorator support for async functions and async generators
asyncio.to_threadfor non-blocking SQLite operations- Transparent async stream wrapping
v0.4.0¶
- Gemini SDK auto-parser
- Local/Ollama provider support with $0 cost tracking
- Rate limiting for local models
v0.3.0¶
RateLimiterwith sliding-window RPM and TPM enforcementLocalRateLimitErrorwithretry_afterattribute- Pre-call rate limit checks in the decorator
v0.2.0¶
- Streaming support for OpenAI and Anthropic
StreamAccumulatorfor chunk-based token tracking- Character-based token estimation fallback
v0.1.0¶
- Initial release
@track_costsdecorator with auto-detection for OpenAI and AnthropicPricingRegistrywith built-in model pricingSQLiteStorefor local persistenceCostReporterwith color-coded terminal outputBudgetExceededErrorfor budget enforcement- GitHub Actions release pipeline