rmsnorm

Star

Here are 11 public repositories matching this topic...

bzhangGo / rmsnorm

Star

Root Mean Square Layer Normalization

layernorm rmsnorm

Updated Mar 28, 2023
Python

knotgrass / Griffin

Star

Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models

h3 linear attention language-model griffin mamba gelu conv1d rmsnorm rg-lru shift-ssm

Updated Dec 23, 2024
Python

dtunai / Tri-RMSNorm

Star

Efficient kernel for RMS normalization with fused operations, includes both forward and backward passes, compatibility with PyTorch.

machine-learning ai triton rmsnorm

Updated Jun 5, 2024
Python

Simple and easy to understand PyTorch implementation of Large Language Model (LLM) GPT and LLAMA from scratch with detailed steps. Implemented: Byte-Pair Tokenizer, Rotational Positional Embedding (RoPe), SwishGLU, RMSNorm, Mixture of Experts (MOE). Tested on Taylor Swift song lyrics dataset.

moe mixture-of-experts kv-cache llm rmsnorm swiglu pytorch-llm byte-pair-tokenizer rotational-positional-embedding

Updated Nov 18, 2024
Python

shizuka-kuze / OpenLabLM

Star

A modern minimal LLM implementation made to be easily modified by non-professionals and trained on consumer hardware.

nlp machine-learning natural-language-processing deep-neural-networks ai deep-learning ml pytorch artificial-intelligence muon natural-language-generation language-model natural-language-understanding pytorch-implementation large-language-models llm generative-ai rmsnorm multi-head-latent-attention

Updated Dec 14, 2025
Python

sushantkumar23 / nano-gpt

Star

Simple character level Transformer

transformers pytorch attention attention-mechanism rope self-attention multi-head-attention shakespeare-dataset transformer-architecture llm rmsnorm

Updated May 27, 2024
Jupyter Notebook

rmgogogo / nano-aigc

Star

Generative models nano version for fun. No STOA here, nano first.

Updated Jul 27, 2025
Jupyter Notebook

ralolooafanxyaiml / frad

Star

A from-scratch PyTorch LLM implementing Sparse Mixture-of-Experts (MoE) with Top-2 gating. Integrates modern Llama-3 components (RMSNorm, SwiGLU, RoPE, GQA) and a custom-coded Byte-Level BPE tokenizer. Pre-trained on a curated corpus of existential & dark philosophical literature.

python pytorch transformer moe from-scratch mixture-of-experts bpe gqa llm rmsnorm swiglu existential-ai llama-3-architecture rope-embeddings custom-tokenizer

Updated Dec 1, 2025
Python

MadrasLe / MGRrmsnorm

Star

Optimized Fused RMSNorm implementation with CUDA. Features vectorized memory access (float4), warp-level reductions, and efficient backward pass for LLM training

kernel deep-learning optimization high-performance cuda transformer gpu-computing custom-kernel llm rmsnorm

Updated Dec 24, 2025
Python

luciITby / OpenLabLM

Star

🚀 Build your own LLM easily with OpenLabLM, a lightweight, hackable codebase tailored for hobbyists using a single consumer GPU.

nlp machine-learning deep-neural-networks ai deep-learning pytorch muon natural-language-generation language-model natural-language-understanding pytorch-implementation large-language-models llm generative-ai rmsnorm multi-head-latent-attention

Updated Jan 4, 2026
Python

GanMaoyuan / notes-on-RMSNorm-paper

Star

《Root Mean Square Layer Normalization》论文阅读笔记

rmsnorm

Updated Oct 19, 2025

Improve this page

Add a description, image, and links to the rmsnorm topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the rmsnorm topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rmsnorm

Here are 11 public repositories matching this topic...

bzhangGo / rmsnorm

knotgrass / Griffin

dtunai / Tri-RMSNorm

s-chh / PyTorch-Scratch-LLM

shizuka-kuze / OpenLabLM

sushantkumar23 / nano-gpt

rmgogogo / nano-aigc

ralolooafanxyaiml / frad

MadrasLe / MGRrmsnorm

luciITby / OpenLabLM

GanMaoyuan / notes-on-RMSNorm-paper

Improve this page

Add this topic to your repo