Add Automated Circuit Breaker Regression Detection System#943
Open
nirmitparikh8 wants to merge 10 commits intopid-take-2from
Open
Add Automated Circuit Breaker Regression Detection System#943nirmitparikh8 wants to merge 10 commits intopid-take-2from
nirmitparikh8 wants to merge 10 commits intopid-take-2from
Conversation
be0d5c2 to
2d17fc3
Compare
nirmitparikh8
commented
Dec 19, 2025
Contributor
Author
There was a problem hiding this comment.
This is the file to tune configs. Let's iterate on this and make the configs more restrictive. Right now we allow 80 percent of changes in error trend. We should figure out why these violations are so high when we compare new csv to baseline and fix that. Could be something with the test or with hoq we check for regressions (This new system I built)
b730d85 to
2d17fc3
Compare
aeba5a1 to
2231b64
Compare
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
🎯 What We're Trying to Achieve
Replace manual visual inspection of experiment results with an automated statistical regression detection system that can:
🔧 How We're Achieving It
Statistical Approach: Percentile-Based Control Charts
|actual_rate - target_rate| / target_rate * 100Automated Pipeline Components
collect_baseline_data.rb: Automated baseline collection (runs experiments N times, organizes results)compute_baselines.rb: Statistical analysis (calculates percentiles from historical data)detect_regressions.rb: Main detection engine (compares current results vs baselines)regression_config.rb: Centralized, tunable configuration📊 Pipeline Flow & Math
🚧 Current State & Progress
✅ What's Complete
🔧 What We're Still Tuning
🤖 Current Configuration (Intentionally Generous)
Why So Generous? I wanted to develop a baseline MVP for the team to iterate on before my internship ends. I didn't get time to go deeper.
📈 Next Steps
collect_baseline_data.rb 15across multiple weeks🎯 Success Metrics
This PR establishes the foundation for data-driven circuit breaker regression detection. The generous initial configuration ensures we don't block development while we gather data to optimize the system.