VINO: A Unified Visual Generator with Interleaved OmniModal Context Paper • 2601.02358 • Published 4 days ago • 27
Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation Paper • 2512.24271 • Published 10 days ago • 49
Figure It Out: Improving the Frontier of Reasoning with Active Visual Thinking Paper • 2512.24297 • Published 10 days ago • 5
UltraShape 1.0: High-Fidelity 3D Shape Generation via Scalable Geometric Refinement Paper • 2512.21185 • Published 16 days ago • 28
Yume-1.5: A Text-Controlled Interactive World Generation Model Paper • 2512.22096 • Published 14 days ago • 57
UniPercept: Towards Unified Perceptual-Level Image Understanding across Aesthetics, Quality, Structure, and Texture Paper • 2512.21675 • Published 15 days ago • 24
The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding Paper • 2512.19693 • Published 18 days ago • 62
MedSAM3: Delving into Segment Anything with Medical Concepts Paper • 2511.19046 • Published Nov 24, 2025 • 49