A compilation of resources for keeping up with the latest trends in NLP.
Note: This resource list is a work in progress. More papers and topics will be added regularly. Contributions and suggestions are welcome!
- BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
- GPT1
- GPT2
- T5
- XLNet
- RoBERTa
- ALBERT
- LongFormer
Reinforcement Learning with Human Feedback: Learning Dynamic Choices via Pessimism
DPO:
PPO:
- Proximal Policy Optimization Algorithms
- PPO Docs OpenAI
- Understanding PPO from First Principles Blog
GRPO:
- Basic Mech Interp Essay
- Toy Neural Nets with low dimensional inputs
- Mechanistic Interpretability for AI Safety Review
- A Mathematical Framework for Transformer Circuits
- Circuit Tracing: Revealing Computational Graphs in Language Models