Skip to content

Commit 76a7cce

Browse files
committed
replace offload with smaller model
1 parent 35e1ad6 commit 76a7cce

File tree

1 file changed

+3
-1
lines changed

1 file changed

+3
-1
lines changed

.github/workflows/ci_gpu.yml

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -131,7 +131,9 @@ jobs:
131131
run: python dev/data/tinyshakespeare.py --model_desc llama-3
132132

133133
- name: Train model
134-
run: python train_llama3.py --write_tensors 1 --dtype float32 --offload 1
134+
# use the first 10 layers, so that everything fits into the 20GB of
135+
# the A4000 Ada that we have in CI
136+
run: python train_llama3.py --write_tensors 1 --dtype float32 --depth 10
135137

136138
- name: Build FP32 precision
137139
run: PRECISION=FP32 make test_llama3cu

0 commit comments

Comments
 (0)