From Data to Rewards: a Bilevel Optimization Perspective on Maximum Likelihood Estimation Paper • 2510.07624 • Published Oct 8, 2025 • 7
Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning Paper • 2509.24372 • Published Sep 29, 2025 • 9
view article Article 🌁#85: Curiosity, Open Source, and Timing: The Formula Behind DeepSeek’s Phenomenal Success Jan 27, 2025 • 6
Evolution and The Knightian Blindspot of Machine Learning Paper • 2501.13075 • Published Jan 22, 2025 • 6