Hi,
I’m following along with the course materials as a non-Stanford participant. I noticed that when I run the Qwen 2.5 Math 1.5B model locally via Hugging Face, I get slightly different greedy decoding outputs compared to the tests.
Could you confirm:
Are the model weights used in the course identical to the Hugging Face Qwen/Qwen2.5-Math-1.5B release?
Or is there a Stanford-hosted version / checkpoint with any fine-tuning or other changes?
Thanks in advance!