Nvidia Blackwell Support #7

LZL0 · 2025-12-08T20:54:41Z

Summary by cubic

Optimized streaming guardrails and drift detection for Nvidia Blackwell–class throughput and added a full benchmark suite and docs. L0 now sustains 90K+ tokens/s with full features, providing ample headroom for 1000+ t/s models.

Performance
- JSON guardrail now uses incremental state (O(delta) per token) with full analysis only at completion.
- Drift detection now analyzes a sliding window (default 500 chars) for meta/tone/repetition/markdown checks.
- Tuned defaults: guardrails=15, drift=25, checkpoint=20; updated examples in ADVANCED.md.
Benchmarks
- New benchmark suite (tests/test_benchmark.py) and BENCHMARKS.md with scenarios, TTFT, and overhead.
- README highlights “Blackwell-ready”.

^{Written for commit dcb1a47. Summary will update automatically on new commits.}

cubic-dev-ai

2 issues found across 7 files

Prompt for AI agents (all 2 issues)


Check if these issues are valid — if so, understand the root cause of each and fix them.


<file name="BENCHMARKS.md">

<violation number="1" location="BENCHMARKS.md:50">
P2: Incorrect import pattern in code example. The L0 API exports `run` directly from the module, so `from l0 import l0` will fail. Use `import l0` instead, and replace `json_rule()` with an actual exported guardrail like `l0.JSON_ONLY_GUARDRAILS` or `l0.Guardrails.recommended()`.</violation>
</file>

<file name="tests/test_benchmark.py">

<violation number="1" location="tests/test_benchmark.py:291">
P2: Inconsistent TTFT measurement: `start_time` is set after `_internal_run` completes, but in `run_baseline_benchmark` it&#39;s set before iteration. This makes time-to-first-token comparisons between baseline and L0 benchmarks invalid. Consider moving `start_time` before the `_internal_run` call to be consistent.</violation>
</file>

_{Reply to cubic to teach it or ask questions. Re-run a review with @cubic-dev-ai review this PR}

cubic-dev-ai · 2025-12-08T21:01:36Z

BENCHMARKS.md

+
+Configure via `check_intervals`:
+```python
+from l0 import l0


P2: Incorrect import pattern in code example. The L0 API exports run directly from the module, so from l0 import l0 will fail. Use import l0 instead, and replace json_rule() with an actual exported guardrail like l0.JSON_ONLY_GUARDRAILS or l0.Guardrails.recommended().

Prompt for AI agents

Check if this issue is valid — if so, understand the root cause and fix it. At BENCHMARKS.md, line 50: <comment>Incorrect import pattern in code example. The L0 API exports `run` directly from the module, so `from l0 import l0` will fail. Use `import l0` instead, and replace `json_rule()` with an actual exported guardrail like `l0.JSON_ONLY_GUARDRAILS` or `l0.Guardrails.recommended()`.</comment> <file context> @@ -0,0 +1,97 @@ + +Configure via `check_intervals`: +```python +from l0 import l0 + +result = await l0.run( </file context>

✅ Addressed in 768c1a8

cubic-dev-ai · 2025-12-08T21:01:36Z

tests/test_benchmark.py

+        check_intervals=check_intervals,
+    )
+
+    start_time = time.perf_counter()


P2: Inconsistent TTFT measurement: start_time is set after _internal_run completes, but in run_baseline_benchmark it's set before iteration. This makes time-to-first-token comparisons between baseline and L0 benchmarks invalid. Consider moving start_time before the _internal_run call to be consistent.

Prompt for AI agents

Check if this issue is valid — if so, understand the root cause and fix it. At tests/test_benchmark.py, line 291: <comment>Inconsistent TTFT measurement: `start_time` is set after `_internal_run` completes, but in `run_baseline_benchmark` it's set before iteration. This makes time-to-first-token comparisons between baseline and L0 benchmarks invalid. Consider moving `start_time` before the `_internal_run` call to be consistent.</comment> <file context> @@ -0,0 +1,1044 @@ + check_intervals=check_intervals, + ) + + start_time = time.perf_counter() + + async for event in result: </file context>

✅ Addressed in 9d74796

LZL0 added 10 commits December 8, 2025 21:29

Create test_benchmark.py

f077585

Performance fixes.

b215376

Update ADVANCED.md

86c1d01

Update guardrails.py

02c28ad

Create BENCHMARKS.md

98faf79

Update README.md

17783bf

Update README.md

19473c9

Update BENCHMARKS.md

36913bc

Update README.md

10c2830

Update README.md

056682b

cubic-dev-ai bot reviewed Dec 8, 2025

View reviewed changes

LZL0 added 3 commits December 8, 2025 22:02

Update BENCHMARKS.md

768c1a8

Update test_benchmark.py

9d74796

Update BENCHMARKS.md

dcb1a47

LZL0 merged commit 8ec63c3 into master Dec 8, 2025
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Nvidia Blackwell Support #7

Nvidia Blackwell Support #7

Uh oh!

LZL0 commented Dec 8, 2025 •

edited by cubic-dev-ai bot

Loading

Uh oh!

cubic-dev-ai bot left a comment

Uh oh!

cubic-dev-ai bot Dec 8, 2025 •

edited

Loading

Uh oh!

cubic-dev-ai bot Dec 8, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Nvidia Blackwell Support #7

Nvidia Blackwell Support #7

Uh oh!

Conversation

LZL0 commented Dec 8, 2025 • edited by cubic-dev-ai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by cubic

Uh oh!

cubic-dev-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

cubic-dev-ai bot Dec 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cubic-dev-ai bot Dec 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

LZL0 commented Dec 8, 2025 •

edited by cubic-dev-ai bot

Loading

cubic-dev-ai bot Dec 8, 2025 •

edited

Loading

cubic-dev-ai bot Dec 8, 2025 •

edited

Loading