Unifying Group-Relative and Self-Distillation Policy Optimization via Sample Routing Paper • 2604.02288 • Published 12 days ago • 30
TriAttention: Efficient Long Reasoning with Trigonometric KV Compression Paper • 2604.04921 • Published 8 days ago • 105
MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale Paper • 2604.04771 • Published 8 days ago • 115
VideoZeroBench: Probing the Limits of Video MLLMs with Spatio-Temporal Evidence Verification Paper • 2604.01569 • Published 12 days ago • 12
Proactive Agent Research Environment: Simulating Active Users to Evaluate Proactive Assistants Paper • 2604.00842 • Published 13 days ago • 13
GEMS: Agent-Native Multimodal Generation with Memory and Skills Paper • 2603.28088 • Published 15 days ago • 85 • 4
GEMS: Agent-Native Multimodal Generation with Memory and Skills Paper • 2603.28088 • Published 15 days ago • 85
Emergent Social Intelligence Risks in Generative Multi-Agent Systems Paper • 2603.27771 • Published 16 days ago • 51
TAPS: Task Aware Proposal Distributions for Speculative Sampling Paper • 2603.27027 • Published 17 days ago • 142
GEMS: Agent-Native Multimodal Generation with Memory and Skills Paper • 2603.28088 • Published 15 days ago • 85
GEMS: Agent-Native Multimodal Generation with Memory and Skills Paper • 2603.28088 • Published 15 days ago • 85
EndoCoT: Scaling Endogenous Chain-of-Thought Reasoning in Diffusion Models Paper • 2603.12252 • Published Mar 12 • 12
MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data Paper • 2603.09206 • Published Mar 10 • 53
Reasoning Models Struggle to Control their Chains of Thought Paper • 2603.05706 • Published Mar 5 • 37
Towards Multimodal Lifelong Understanding: A Dataset and Agentic Baseline Paper • 2603.05484 • Published Mar 5 • 4
Beyond Language Modeling: An Exploration of Multimodal Pretraining Paper • 2603.03276 • Published Mar 3 • 103
AI Gamestore: Scalable, Open-Ended Evaluation of Machine General Intelligence with Human Games Paper • 2602.17594 • Published Feb 19 • 9