We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 35e1ad6 commit 76a7cceCopy full SHA for 76a7cce
.github/workflows/ci_gpu.yml
@@ -131,7 +131,9 @@ jobs:
131
run: python dev/data/tinyshakespeare.py --model_desc llama-3
132
133
- name: Train model
134
- run: python train_llama3.py --write_tensors 1 --dtype float32 --offload 1
+ # use the first 10 layers, so that everything fits into the 20GB of
135
+ # the A4000 Ada that we have in CI
136
+ run: python train_llama3.py --write_tensors 1 --dtype float32 --depth 10
137
138
- name: Build FP32 precision
139
run: PRECISION=FP32 make test_llama3cu
0 commit comments