Broken error messages in token normalization—`{token!r}` shown literally instead of actual token value (#579) by markknoffler · Pull Request #580 · google-deepmind/gemma

markknoffler · 2026-02-19T17:08:21Z

Fix for Issue #579

Root cause

In _normalize_token, when a stop_token or forbidden_token string maps to multiple token IDs, the code raises a ValueError using a plain string instead of an f-string:

raise ValueError(
    'Invalid token: {token!r}. `stop_token`s and `forbidden_token`s must'
    ' map to single token ids in the vocab.'
)

Because there is no f prefix, Python treats {token!r} as literal text. Users see:

ValueError: Invalid token: {token!r}. `stop_token`s and `forbidden_token`s must map to single token ids in the vocab.

instead of the actual invalid token value, making debugging harder.

Fix summary

Use an f-string so {token!r} is interpolated and the real token value appears in the error message. Apply the same change in both affected files.

Patch sketch

1. gemma/gm/text/_sampler.py

raise ValueError(
    f'Invalid token: {token!r}. `stop_token`s and `forbidden_token`s must'
    ' map to single token ids in the vocab.'
)

2. gemma/research/t5gemma/sampling.py

raise ValueError(
    f'Invalid token: {token!r}. `stop_token`s and `forbidden_token`s must'
    ' map to single token ids in the vocab.'
)

fix: resolve fstring token bug

c57cafa

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Broken error messages in token normalization—`{token!r}` shown literally instead of actual token value (#579)#580

Broken error messages in token normalization—`{token!r}` shown literally instead of actual token value (#579)#580
markknoffler wants to merge 1 commit intogoogle-deepmind:mainfrom
markknoffler:fix/fix-fstring-token-bug

markknoffler commented Feb 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

markknoffler commented Feb 19, 2026

Fix for Issue #579

Root cause

Fix summary

Patch sketch

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant