Non-Stanford (following along): Is the Qwen 2.5 Math 1.5B different to HF?

Hi,

I’m following along with the course materials as a non-Stanford participant. I noticed that when I run the Qwen 2.5 Math 1.5B model locally via Hugging Face, I get slightly different greedy decoding outputs compared to the tests.
Could you confirm:

Are the model weights used in the course identical to the Hugging Face Qwen/Qwen2.5-Math-1.5B release?

Or is there a Stanford-hosted version / checkpoint with any fine-tuning or other changes?

Thanks in advance!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Non-Stanford (following along): Is the Qwen 2.5 Math 1.5B different to HF? #3

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Non-Stanford (following along): Is the Qwen 2.5 Math 1.5B different to HF? #3

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions