A Survey of Reinforcement Learning for Large Reasoning Models Paper • 2509.08827 • Published Sep 10, 2025 • 190
ByteDance Papers Collection ByteDance papers collection • 134 items • Updated about 12 hours ago • 22
barc0/200k_HEAVY_gpt4o-description-gpt4omini-code_generated_problems Viewer • Updated Nov 2, 2024 • 139k • 119 • 11
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models Paper • 2503.09573 • Published Mar 12, 2025 • 74
You Do Not Fully Utilize Transformer's Representation Capacity Paper • 2502.09245 • Published Feb 13, 2025 • 37