-
Notifications
You must be signed in to change notification settings - Fork 5
Open
Description
Hi,
Thank you for the great and well-organized repo! I was wondering if you have a way to provide numerical scores at each step, instead of just \boxed{correct} vs \boxed{incorrect}. The huggingface alludes to this:
"Overconfidence: Generative PRMs like ThinkPRM can sometimes produce scores clustered near 0 or 1, potentially not reflecting true uncertainty"
However, I was wondering how I can access these scores. Thanks!
Metadata
Metadata
Assignees
Labels
No labels