RoboGround: Robotic Manipulation with Grounded Vision-Language Priors Paper • 2504.21530 • Published Apr 30, 2025
CronusVLA: Transferring Latent Motion Across Time for Multi-Frame Prediction in Manipulation Paper • 2506.19816 • Published Jun 24, 2025
GENMANIP: LLM-driven Simulation for Generalizable Instruction-Following Manipulation Paper • 2506.10966 • Published Jun 12, 2025
MM-ACT: Learn from Multimodal Parallel Generation to Act Paper • 2512.00975 • Published Nov 30, 2025 • 6
InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy Paper • 2510.13778 • Published Oct 15, 2025 • 16