feat: Major features v1.3.0 - Cost Tracking, Caching, Multi-User, A/B Testing #7

murataslan1 · 2025-12-19T02:59:09Z

Summary

This PR adds major new features to ccproxy v1.3.0, including cost tracking, request caching, multi-user support, and A/B testing framework. All features are fully tested with 356 tests passing.

New Modules

Module	Description
`metrics.py`	Cost tracking with budget alerts
`cache.py`	LRU request cache with TTL
`users.py`	Multi-user token/cost limits
`ab_testing.py`	Model comparison framework

Bug Fixes

OAuth graceful fallback when credentials missing
Router initialization race condition
Request metadata memory leak (TTL + max size)
Model reload thrashing with cooldown

Enhancements

ccproxy status --health for health metrics
ccproxy shell-integration for shell aliases
OAuth token background refresh
Configuration validation at startup
Global tokenizer cache for performance
Request retry with exponential backoff

Documentation

docs/troubleshooting.md - Common issues
docs/architecture.md - System design
docs/examples.md - Configuration examples
Updated README.md with all new features

Tests

356 tests passing
71% code coverage
New test files for all new modules

## New Features ### Core Modules Added - **Cost Tracking** (metrics.py): Per-request cost calculation, budget limits, alerts - **Request Caching** (cache.py): LRU cache with TTL, invalidation strategies - **Multi-User Support** (users.py): Per-user token/cost limits, model access control - **A/B Testing** (ab_testing.py): Model comparison with statistical analysis ### Bug Fixes - OAuth graceful fallback when credentials missing - Router initialization race condition fixed - Request metadata store memory leak fixed (TTL + max size) - Model reload thrashing fixed with cooldown - Default config now works out of the box ### Enhancements - Health check endpoint: ccproxy status --health - Shell integration: ccproxy shell-integration --shell [bash|zsh|fish] - OAuth token background refresh - Configuration validation at startup - Global tokenizer cache for performance - Request retry with exponential backoff ### Documentation - docs/troubleshooting.md: Common issues and solutions - docs/architecture.md: System design with ASCII diagrams - docs/examples.md: Configuration examples for various use cases - README.md updated with all new features ### Tests - 356 tests passing - New test files: test_metrics, test_cache, test_users, test_ab_testing, etc. - Coverage improved to 71%

starbaser · 2025-12-20T03:20:47Z

Before reviewing 6,000 lines: can you write up, in your own words, what problem you're solving and what value this adds?
For the bug fixes, please create separate issues with reproduction steps. Then open PRs referencing those issues.

murataslan1 · 2025-12-21T18:24:42Z

Hey,

You're right, 6k lines was way too much. My bad.

Been thinking about this - wanna break it into small PRs instead. Each one solving a specific thing.

First one would be config validation with helpful errors. Fixes that "default config doesn't work" issue. Pretty small change, maybe couple hundred lines, no breaking stuff. Basically when someone messes up the YAML they get an actual helpful message instead of a stack trace.

After that got some other stuff lined up (oauth race condition, connection pooling etc) but lets start small.

Want me to go ahead with the config one?

starbaser · 2026-01-06T18:13:40Z

Happy new year! actually yea config validation would be great, but the "default" behavior is what, normal claude code usage? users interacting through their llm harness may not be aware of adding ccproxy as a dep, add the alias from the readme and their harness stops working.

However, a config assist would be too useful to pass up for what's coming in the pre-release I'm going post today, it's all on the dev branch already. It has some important new features, including some that I would only talk about off-platform besides linking this Also, sorry, it likely will break your in-progress version.

murataslan1 · 2026-01-08T12:52:14Z

Hey @starbaser, thanks for the heads up! I've switched to the dev branch as suggested to avoid conflicts with the new features.

I've implemented the config validation logic we discussed. It now catches
yaml syntax errors and Pydantic validation errors, displaying them in a nice, readable red panel using rich (instead of crashing with a stack trace). I've verified it against valid and invalid configs.

I've pushed the changes to my fork. Should I open a new PR for this feature (feat/config-validation) targeting dev, or would you prefer to handle it differently?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Major features v1.3.0 - Cost Tracking, Caching, Multi-User, A/B Testing #7

feat: Major features v1.3.0 - Cost Tracking, Caching, Multi-User, A/B Testing #7

Uh oh!

murataslan1 commented Dec 19, 2025

Uh oh!

starbaser commented Dec 20, 2025

Uh oh!

murataslan1 commented Dec 21, 2025

Uh oh!

starbaser commented Jan 6, 2026 •

edited

Loading

Uh oh!

murataslan1 commented Jan 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat: Major features v1.3.0 - Cost Tracking, Caching, Multi-User, A/B Testing #7

Are you sure you want to change the base?

feat: Major features v1.3.0 - Cost Tracking, Caching, Multi-User, A/B Testing #7

Uh oh!

Conversation

murataslan1 commented Dec 19, 2025

Summary

New Modules

Bug Fixes

Enhancements

Documentation

Tests

Uh oh!

starbaser commented Dec 20, 2025

Uh oh!

murataslan1 commented Dec 21, 2025

Uh oh!

starbaser commented Jan 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

murataslan1 commented Jan 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

starbaser commented Jan 6, 2026 •

edited

Loading