Skip to content

Conversation

@mohit-sarvam
Copy link

What does this PR do ?

Add a one line overview of what this PR aims to accomplish.

Changelog

  • Add specific line by line info of high level changes in this PR.

GitHub Actions CI

See the CI sectionin the Contributing doc for how to trigger the CI. A Nvidia developer will need to approve and trigger the CI for external contributors.

Before your PR is "Ready for review"

Pre checks:

  • Make sure you read and followed Contributor guidelines
  • Did you write any new necessary tests?
  • Did you add or update any necessary documentation?
  • Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
    • Reviewer: Does the PR have correct import guards for all optional libraries?

If you haven't finished some of the above items you can still open "Draft" PR.

Additional Information

  • Related to # (issue)

@copy-pr-bot
Copy link

copy-pr-bot bot commented Dec 28, 2025

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@mohit-sarvam mohit-sarvam marked this pull request as draft December 28, 2025 11:34
@mohit-sarvam mohit-sarvam changed the title Sarvam MoE support for 30B Sarvam MoE support for 30B and 100B Dec 28, 2025
@mohit-sarvam mohit-sarvam changed the title Sarvam MoE support for 30B and 100B Sarvam MoE support Dec 28, 2025
@yaoyu-33
Copy link
Contributor

@mohit-sarvam thanks for contribution. Please let us know if you need help.

@cuichenx
Copy link
Contributor

cuichenx commented Jan 2, 2026

Thanks for the contribution. Could you add unit and functional tests for this model? You can follow the examples here

@mohit-sarvam
Copy link
Author

@uppalutkarsh

@mohit-sarvam mohit-sarvam marked this pull request as ready for review January 8, 2026 14:09
@mohit-sarvam mohit-sarvam marked this pull request as draft January 8, 2026 14:09
@mohit-sarvam
Copy link
Author

We will add functional tests once the model is open sourced.

Signed-off-by: mohit-sarvam <[email protected]>
Signed-off-by: mohit-sarvam <[email protected]>
Signed-off-by: mohit-sarvam <[email protected]>
Signed-off-by: mohit-sarvam <[email protected]>
Signed-off-by: mohit-sarvam <[email protected]>
@yaoyu-33
Copy link
Contributor

thanks @mohit-sarvam

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants