Revisiting On-Policy Distillation: Empirical Failure Modes and Simple Fixes Paper • 2603.25562 • Published 3 days ago • 4 • 2
Can MLLMs Read Students' Minds? Unpacking Multimodal Error Analysis in Handwritten Math Paper • 2603.24961 • Published 3 days ago • 1 • 2
Extending Precipitation Nowcasting Horizons via Spectral Fusion of Radar Observations and Foundation Model Priors Paper • 2603.21768 • Published 6 days ago • 2
MACRO: Advancing Multi-Reference Image Generation with Structured Long-Context Data Paper • 2603.25319 • Published 3 days ago • 26 • 2
Vega: Learning to Drive with Natural Language Instructions Paper • 2603.25741 • Published 3 days ago • 4 • 2
PMT: Plain Mask Transformer for Image and Video Segmentation with Frozen Vision Encoders Paper • 2603.25398 • Published 3 days ago • 1 • 2
SlopCodeBench: Benchmarking How Coding Agents Degrade Over Long-Horizon Iterative Tasks Paper • 2603.24755 • Published 4 days ago • 22 • 2
FinMCP-Bench: Benchmarking LLM Agents for Real-World Financial Tool Use under the Model Context Protocol Paper • 2603.24943 • Published 3 days ago • 5 • 2
Electrostatic Photoluminescence Tuning in All-Solid-State Perovskite Transistors Paper • 2603.25718 • Published 3 days ago • 1 • 2
Nudging Hidden States: Training-Free Model Steering for Chain-of-Thought Reasoning in Large Audio-Language Models Paper • 2603.14636 • Published 13 days ago • 1 • 2
AVO: Agentic Variation Operators for Autonomous Evolutionary Search Paper • 2603.24517 • Published 4 days ago • 5 • 2
Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale Paper • 2603.25040 • Published 3 days ago • 104 • 6
VFIG: Vectorizing Complex Figures in SVG with Vision-Language Models Paper • 2603.24575 • Published 4 days ago • 11 • 2
WAFT-Stereo: Warping-Alone Field Transforms for Stereo Matching Paper • 2603.24836 • Published 3 days ago • 1 • 2