AceSearcher: Bootstrapping Reasoning and Search for LLMs via Reinforced Self-Play Paper • 2509.24193 • Published Sep 29 • 6