Blogs, papers, and articles I have found useful and enjoyed.
- Speeding up RL with high-leverage samples · March 2026
- Composer 2 Technical Report · March 2026
- State of RL for reasoning LLMs · March 2026
- Recursive Language Models · October 2025
- Defeating Nondeterminism in LLM Inference · September 2025