Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning Paper • 2512.15687 • Published 10 days ago • 17
Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation Paper • 2509.15194 • Published Sep 18 • 33
LabSafety Bench: Benchmarking LLMs on Safety Issues in Scientific Labs Paper • 2410.14182 • Published Oct 18, 2024 • 1