MARS: Enabling Autoregressive Models Multi-Token Generation Paper • 2604.07023 • Published 3 days ago • 28
How Well Do Agentic Skills Work in the Wild: Benchmarking LLM Skill Usage in Realistic Settings Paper • 2604.04323 • Published 5 days ago • 34
Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding Paper • 2604.05015 • Published 5 days ago • 222
Swift-SVD: Theoretical Optimality Meets Practical Efficiency in Low-Rank LLM Compression Paper • 2604.01609 • Published 9 days ago • 10
Beyond Accuracy: Unveiling Inefficiency Patterns in Tool-Integrated Reasoning Paper • 2604.05404 • Published 4 days ago • 38
VLMs Need Words: Vision Language Models Ignore Visual Detail In Favor of Semantic Anchors Paper • 2604.02486 • Published 9 days ago • 9
LightThinker++: From Reasoning Compression to Memory Management Paper • 2604.03679 • Published 7 days ago • 30
TriAttention: Efficient Long Reasoning with Trigonometric KV Compression Paper • 2604.04921 • Published 5 days ago • 97
MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale Paper • 2604.04771 • Published 5 days ago • 110
OpenWorldLib: A Unified Codebase and Definition of Advanced World Models Paper • 2604.04707 • Published 5 days ago • 196
SkillX: Automatically Constructing Skill Knowledge Bases for Agents Paper • 2604.04804 • Published 5 days ago • 26
FlowSlider: Training-Free Continuous Image Editing via Fidelity-Steering Decomposition Paper • 2604.02088 • Published 9 days ago • 6
UniDriveVLA: Unifying Understanding, Perception, and Action Planning for Autonomous Driving Paper • 2604.02190 • Published 9 days ago • 25
GPA: Learning GUI Process Automation from Demonstrations Paper • 2604.01676 • Published 9 days ago • 16
The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook Paper • 2604.02029 • Published 9 days ago • 137
QuitoBench: A High-Quality Open Time Series Forecasting Benchmark Paper • 2603.26017 • Published 15 days ago • 31