CS336 Spring 2025 Assignment 5: Alignment

Datasets available

Curated train,val,sft (using gpt 4o api with 383 samples) dataset from MATH

Overview images

sft

RL

Sft experiments

There isn't much to say about sft. Sft is not very sensitive to learning rate, so a relatively reasonable learning rate would lead to good results. Using the full 383-sample sft dataset will produce a checkpoint with validation accuracy of approximately 0.64.

RL experiments

See my wandb report for detailed reports of the ablation studies.

Usage

implementations of functions for unit tests in cs336_alignment/post_training_utils
run sft with uv run cs336_alignment/sft.py
run grpo with uv run cs336_alignment/grpo.py

Link to a repository that helps me a lot

brandon-snider's github repository

Original README content

For a full description of the assignment, see the assignment handout at cs336_spring2025_assignment5_alignment.pdf

We include a supplemental (and completely optional) assignment on safety alignment, instruction tuning, and RLHF at cs336_spring2025_assignment5_supplement_safety_rlhf.pdf

If you see any issues with the assignment handout or code, please feel free to raise a GitHub issue or open a pull request with a fix.

Setup

As in previous assignments, we use uv to manage dependencies.

Install all packages except flash-attn, then all packages (flash-attn is weird)

uv sync --no-install-package flash-attn
uv sync

Run unit tests:

uv run pytest

Initially, all tests should fail with NotImplementedErrors. To connect your implementation to the tests, complete the functions in ./tests/adapters.py.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
cs336_alignment		cs336_alignment
data		data
images		images
scripts		scripts
tests		tests
.gitignore		.gitignore
.python-version		.python-version
CHANGELOG.md		CHANGELOG.md
README.md		README.md
convert_math.py		convert_math.py
cs336_spring2025_assignment5_alignment.pdf		cs336_spring2025_assignment5_alignment.pdf
cs336_spring2025_assignment5_supplement_safety_rlhf.pdf		cs336_spring2025_assignment5_supplement_safety_rlhf.pdf
normalize.py		normalize.py
prepare_dataset.py		prepare_dataset.py
pyproject.toml		pyproject.toml
snapshot_download.py		snapshot_download.py
test_and_make_submission.sh		test_and_make_submission.sh
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CS336 Spring 2025 Assignment 5: Alignment

Datasets available

Overview images

sft

RL

Sft experiments

RL experiments

Usage

Link to a repository that helps me a lot

Original README content

Setup

About

Uh oh!

Releases

Packages

Languages

llltttwww/cs336-25spring-assignment5-alignment

Folders and files

Latest commit

History

Repository files navigation

CS336 Spring 2025 Assignment 5: Alignment

Datasets available

Overview images

sft

RL

Sft experiments

RL experiments

Usage

Link to a repository that helps me a lot

Original README content

Setup

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages