Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching Paper • 2503.05179 • Published Mar 7, 2025 • 46
SafeArena: Evaluating the Safety of Autonomous Web Agents Paper • 2503.04957 • Published Mar 6, 2025 • 21
Learning from Failures in Multi-Attempt Reinforcement Learning Paper • 2503.04808 • Published Mar 4, 2025 • 18
LLM as a Broken Telephone: Iterative Generation Distorts Information Paper • 2502.20258 • Published Feb 27, 2025 • 27
How to Steer LLM Latents for Hallucination Detection? Paper • 2503.01917 • Published Mar 1, 2025 • 11
Identifying Sensitive Weights via Post-quantization Integral Paper • 2503.01901 • Published Feb 28, 2025 • 8
Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs Paper • 2506.14245 • Published Jun 17, 2025 • 45