Create devcontainer.json #367

Cnp11784 · 2025-06-15T02:48:52Z

Making a test run see what happens

* Add a symlink for llama_models * Update README.md paths

…ig -> model_config (meta-llama#225)

Updated model card to remove erroneous heading markdown in Hardware and Software section

…RLs (meta-llama#244) * Model should work with "raw" bytes, never URLs Modeling code or code close to it (chat_format.py specifically) should not be thinking of downloading URLs, etc. Especially not doing it randomly on-demand. * Use ModelInputMessage / ModelOutputMessage and BytesIO * Fixes * Fold everything into a much simpler RawMessage type, update prompt_format

…meta-llama#255) * Test PR Submission - Write Permission * Update EFS volume structuring * Test cron scheduling * Schedule cron job and update branch inputs / actions versioning * Remove previous temporary unnecessary commit lines

Signed-off-by: Dmitry Rogozhkin <[email protected]>

…a-llama#256) * models nightly * publish * environment * schedule * test manual trigger * fix * name * test * test manual input * move back to workflow * dev * dev * workflow_dispatch

* refactor: make llama3 and llama4 generation closer to each other * llama3 script fixes * fixes * add llama3 quant * fix * fix * fix * fix * fix * fix * fix * fix * fix * resurrect xpu codepath * update readme

…hiding of python_start, python_end

Fixes: b12e462 ("refactor: make llama3 generation closer to llama4 (meta-llama#309)") Signed-off-by: Dmitry Rogozhkin <[email protected]>

Verified with Llama3.2-3B-Instruct on Intel Data Center Max Series GPU (PVC): ``` torchrun --nproc-per-node=1 models/llama3/scripts/completion.py \ "$CHECKPOINT_DIR" --world_size 1 torchrun --nproc-per-node=1 models/llama3/scripts/chat_completion.py \ "$CHECKPOINT_DIR" --world_size 1 ``` Signed-off-by: Dmitry Rogozhkin <[email protected]>

Pytorch xccl distributed backend is available starting from 2.7 (requires manual build of pytorch with `USE_C10D_XCCL=1 USE_XCCL=1`) and is targeted for inclusion in binary builds starting from 2.8. This patch improves support of visual llama3 model on XPU devices and tested with `Llama3.2-11B-Vision-Instruct`. Signed-off-by: Dmitry Rogozhkin <[email protected]>

…snorm, name it as such (meta-llama#320)

* fix: update rope scaling for Llama-4-Scout * update * no defaults * fix

``` PYTHONPATH=$(git rev-parse --show-toplevel) \ torchrun --nproc_per_node=1 \ -m models.llama4.scripts.chat_completion ../checkpoints/Llama-4-Scout-17B-16E-Instruct \ --world_size 1 \ --quantization-mode int4_mixed ``` Before this PR: ``` [rank1]: TypeError: quantize_int4() got multiple values for argument 'output_device' ```

# What does this PR do? ## Test Plan

Making a test run see what happens

facebook-github-bot · 2025-06-15T02:48:57Z

Hi @Cnp11784!

Thank you for your pull request and welcome to our community.

Action Required

In order to merge any pull request (code, docs, etc.), we require contributors to sign our Contributor License Agreement, and we don't seem to have one on file for you.

Process

In order for us to review and merge your suggested changes, please sign at https://code.facebook.com/cla. If you are contributing on behalf of someone else (eg your employer), the individual CLA may not be sufficient and your employer may need to sign the corporate CLA.

Once the CLA is signed, our tooling will perform checks and validations. Afterwards, the pull request will be tagged with CLA signed. The tagging process may take up to 1 hour after signing. Please give it that time before contacting us about it.

If you have received this in error or have any questions, please contact us at [email protected]. Thanks!

liyunlu0618 and others added 30 commits October 24, 2024 15:22

Add reference to torchao in quantized llama 1/3B model card. (meta-ll…

4a9ff6a

…ama#193)

Update QuantizationArgs schema to incorporate spinquant, etc. (meta-l…

9db4fba

…lama#195)

Bump version to 0.0.46

e517d3f

Add a symlink for llama_models (meta-llama#201)

7fa8b3b

* Add a symlink for llama_models * Update README.md paths

Update paths in MANIFEST

de71a49

Bump version to 0.0.47

2fe1a16

Bump version to 0.0.48

ab0fb0d

Bump version to 0.0.49

f5e5bc0

Bump version to 0.0.50

ec6b563

Bump version to 0.0.53

8069eb6

accept huggingface repo name when resolving a model

8752149

Bump version to 0.0.54

d968839

align with pydantic v2, @validator -> @field_validator and class Conf…

065b561

…ig -> model_config (meta-llama#225)

Bump version to 0.0.55

362d52a

Bump version to 0.0.56

92864ad

Bump version to 0.0.57

17107db

Add Llama 3.3 (meta-llama#236)

c908f5b

Bump version to 0.0.58

32fd6f1

Add eval_details for llama3_3

804a64f

Update MODEL_CARD.md (meta-llama#238)

e462faf

Updated model card to remove erroneous heading markdown in Hardware and Software section

Bump version to 0.0.59

c2cbe2c

Bump version to 0.0.60

fc1e70e

Bump version to 0.0.61

7890266

Update gitignore

3a5906d

Bump version to 0.0.62

dc60bfe

Bump version to 0.0.63

675e4be

ci: fix missing pypi packages for PR ci (meta-llama#234)

41fd244

Signed-off-by: Dmitry Rogozhkin <[email protected]>

[CICD] github workflow for nightly testpypi llama-models package (met…

299f310

…a-llama#256) * models nightly * publish * environment * schedule * test manual trigger * fix * name * test * test manual input * move back to workflow * dev * dev * workflow_dispatch

raghotham and others added 22 commits April 5, 2025 12:23

Fix link to developer user guide

038acb9

Update README.md (meta-llama#300)

eececc2

Create llama4 prompt_format.md (meta-llama#303)

ebe3527

fix: cleanup MoE docs

2b2e5b2

fix: make sure fp8 quantization only quantizes MoE layers

4f45ca9

fix: quantization script had regressed (meta-llama#308)

699a029

refactor: make llama3 generation closer to llama4 (meta-llama#309)

b12e462

* refactor: make llama3 and llama4 generation closer to each other * llama3 script fixes * fixes * add llama3 quant * fix * fix * fix * fix * fix * fix * fix * fix * fix * resurrect xpu codepath * update readme

fix: iron out differences between llama-models and llama-stack

88e9f6a

fix: a couple minor bugs

63172b3

fix: ensure finetune_right_pad stays at the same position before the …

f62cac6

…hiding of python_start, python_end

fix: fix llama3 generation test (meta-llama#317)

0ed6a1f

Fixes: b12e462 ("refactor: make llama3 generation closer to llama4 (meta-llama#309)") Signed-off-by: Dmitry Rogozhkin <[email protected]>

fix: Kill L2Norm since it was a misnomer, this is just an unscaled rm…

78eb422

…snorm, name it as such (meta-llama#320)

fix: update rope scaling for Llama-4-Scout (meta-llama#322)

408c577

* fix: update rope scaling for Llama-4-Scout * update * no defaults * fix

fix: re-add python_start, python_end tokens (meta-llama#327)

823cd86

update llama4 prompt_format.md (meta-llama#332)

9921d27

# What does this PR do? ## Test Plan

Fix Llama 3.3 model size in README (meta-llama#339)

02b654c

sync prompt_format.md (meta-llama#345)

f3d16d7

# What does this PR do? ## Test Plan

chore: remove usage of load_tiktoken_bpe (meta-llama#357)

01dc8ce

Create devcontainer.json

acae571

Making a test run see what happens

Cnp11784 requested review from ashwinb, dltn, ehhuang, hardikjshah, raghotham and yanxi0830 as code owners June 15, 2025 02:48

Cnp11784 changed the base branch from main to torchao September 24, 2025 08:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Create devcontainer.json #367

Create devcontainer.json #367

Uh oh!

Cnp11784 commented Jun 15, 2025

Uh oh!

facebook-github-bot commented Jun 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants

Create devcontainer.json #367

Are you sure you want to change the base?

Create devcontainer.json #367

Uh oh!

Conversation

Cnp11784 commented Jun 15, 2025

Uh oh!

facebook-github-bot commented Jun 15, 2025

Action Required

Process

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants