Skip to content
View YuvrajSingh-mist's full-sized avatar

Sponsors

@abstrait
@pramodith

Block or report YuvrajSingh-mist

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
YuvrajSingh-mist/README.md

Hi 👋, Myself Yuvraj Singh

A passionate AI/ML developer inclined towards NLP and CV (Multimodality). Aspire to pursue research abroad in the same domain

  • 🔭 I love to replicate SOTA papers, especially langauge models and RL ones!

  • 🌱 I’m currently learning about LLM-RL and Quatizations for Edge AI

  • 🤝 I’m looking for RE/RS intern or FTE roles in AI/ML domain

  • 📫 How to reach me [email protected]


🔥 My Stats :

Yuvraj Singh's Streak

[Yuvraj's github activity graph]

Pinned Loading

  1. Paper-Replications Paper-Replications Public

    A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch

    Jupyter Notebook 402 44

  2. NeatRL NeatRL Public

    Repository of implementations of classic and sota rl algorithms from scratch in PyTorch

    Python 221 21

  3. SmolHub SmolHub Public

    A collection of my from-scratch implementation of various models deployed on HF Spaces

    Python 10 1

  4. smolcluster smolcluster Public

    A distributed training and infra library

    Python 23

  5. SmolMixtral SmolMixtral Public

    So, I trained a MoE based a 124M (8x12M) architecture I coded from ground up to build a small instruct model, going through the below-mentioned stages from scratch. Trained on TiyStories dataset fo…

    Python 14 1

  6. SmolWhisper SmolWhisper Public

    Trained a Whisper model a ~30M (whisper tiny.en) architecture I coded from ground up to build a small ASR model, going through the below-mentioned stage from scratch. Trained on GigaSpeech dataset …

    Python 9 1