-
Notifications
You must be signed in to change notification settings - Fork 227
Open
Description
The problem is in the function run_grpo_microbatch_train_step.
Its docstring implies that the function should expect constant_normalize_factor as an input argument(which is very convenient for ablation studies on length normalization):
assignment5-alignment/tests/adapters.py
Line 237 in a016aff
| constant_normalize_factor: int | None, provided if we want to sum over |
And yet there is no such argument in the function definition😢
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels