Emergent Abilities of Large Language Models under Continued Pretraining for Language Adaptation Paper • 2506.00288 • Published May 30, 2025 • 1
SimpleRL-Zoo Collection The collection for the Paper "SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild" • 13 items • Updated May 5, 2025 • 8
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published Jan 8, 2025 • 287
WiCkeD: A Simple Method to Make Multiple Choice Benchmarks More Challenging Paper • 2502.18316 • Published Feb 25, 2025 • 2