1 16 15

Yolo Y. Tang

yunlong10

https://yunlong10.github.io/

AI & ML interests

LMMs/Agents for Video Understanding

Recent Activity

upvoted a paper about 2 months ago

Video-R4: Reinforcing Text-Rich Video Reasoning with Visual Rumination

upvoted a paper 2 months ago

Latent Chain-of-Thought for Visual Reasoning

upvoted a paper 3 months ago

Directional Reasoning Injection for Fine-Tuning MLLMs

View all activity

Organizations

None yet

upvoted a paper about 2 months ago

Video-R4: Reinforcing Text-Rich Video Reasoning with Visual Rumination

Paper • 2511.17490 • Published Nov 21, 2025 • 21

upvoted a paper 2 months ago

Latent Chain-of-Thought for Visual Reasoning

Paper • 2510.23925 • Published Oct 27, 2025 • 9

upvoted 3 papers 3 months ago

Directional Reasoning Injection for Fine-Tuning MLLMs

Paper • 2510.15050 • Published Oct 16, 2025 • 11

Building a Foundational Guardrail for General Agentic Systems via Synthetic Data

Paper • 2510.09781 • Published Oct 10, 2025 • 26

Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models

Paper • 2510.05034 • Published Oct 6, 2025 • 49

upvoted a paper 4 months ago

Kwai Keye-VL 1.5 Technical Report

Paper • 2509.01563 • Published Sep 1, 2025 • 37

upvoted a paper 5 months ago

ToonComposer: Streamlining Cartoon Production with Generative Post-Keyframing

Paper • 2508.10881 • Published Aug 14, 2025 • 52

upvoted a paper 8 months ago

MMPerspective: Do MLLMs Understand Perspective? A Comprehensive Benchmark for Perspective Perception, Reasoning, and Robustness

Paper • 2505.20426 • Published May 26, 2025 • 7

upvoted 4 papers 9 months ago

Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting

Paper • 2504.05541 • Published Apr 7, 2025 • 15

upvoted a paper 10 months ago

VERIFY: A Benchmark of Visual Explanation and Reasoning for Investigating Multimodal Reasoning Fidelity

Paper • 2503.11557 • Published Mar 14, 2025 • 22

upvoted a paper 12 months ago

Generative AI for Cel-Animation: A Survey

Paper • 2501.06250 • Published Jan 8, 2025 • 12

upvoted a paper about 1 year ago

MMCOMPOSITION: Revisiting the Compositionality of Pre-trained Vision-Language Models

Paper • 2410.09733 • Published Oct 13, 2024 • 8

upvoted a paper over 2 years ago

Language Modeling Is Compression

Paper • 2309.10668 • Published Sep 19, 2023 • 83

Yolo Y. Tang

AI & ML interests

Recent Activity

Organizations

yunlong10's activity