[Draft]feat(recipe): add NVFP4 QAT recipe for Qwen3-30B W4A16 by zhangyimi · Pull Request #36 · verl-project/verl-recipe

zhangyimi · 2026-02-03T17:45:45Z

Description

This PR adds support for NVFP4 Quantization-Aware Training (QAT) with FSDP, enabling W4A16 (weight-only) quantization during RL training.

What's included

verl/utils/qat/ module: QATLinear (Triton FP4 fake quantization), scale fusion, NVFP4 quantizer, and vLLM dynamic weight loading patches
Recipe scripts and configs for Qwen3-30B-A3B W4A16 (full quantization & FFN-only quantization)
Detailed README with implementation overview and experimental results

Key Results

Validated on Qwen3-8B-Base (Dense) and Qwen3-30B-A3B-Base (MoE): W4A16 QAT achieves training accuracy on par with BF16 baseline, while without QAT the KL divergence explodes and training crashes.
70.3% weight memory reduction on Qwen3-30B-A3B during rollout (56.88 GiB → 16.89 GiB), freeing ~40 GiB for additional KV Cache capacity.

VeRL PR: verl-project/verl#5190
README: https://github.com/zhangyimi/verl-recipe/blob/006aa5dabb8dac1f2369e52c3ad27455b84e7799/qat/README.md

zhangyimi mentioned this pull request Feb 3, 2026

[Draft]feat: add NVFP4 QAT (Quantization-Aware Training) support for verl FS… verl-project/verl#5190

Draft

8 tasks

zhangyimi force-pushed the qat branch from 7ef648e to af25908 Compare February 4, 2026 02:42

feat(recipe): add NVFP4 QAT recipe for Qwen3-30B W4A16

fed9d68

zhangyimi force-pushed the qat branch from af25908 to fed9d68 Compare February 4, 2026 08:30

root and others added 5 commits February 5, 2026 00:17

fix error and update readme

77ccff5

update regex

53dce13

add config

95cd758

update README

dfbf09c

update readme

006aa5d

zhangyimi force-pushed the qat branch from 55ec048 to 006aa5d Compare February 6, 2026 09:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Draft]feat(recipe): add NVFP4 QAT recipe for Qwen3-30B W4A16#36

[Draft]feat(recipe): add NVFP4 QAT recipe for Qwen3-30B W4A16#36
zhangyimi wants to merge 6 commits intoverl-project:mainfrom
zhangyimi:qat

zhangyimi commented Feb 3, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

zhangyimi commented Feb 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

What's included

Key Results

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

zhangyimi commented Feb 3, 2026 •

edited

Loading