CHO WOOHYUN's picture

8 6

CHO WOOHYUN

Noename

k1064190

AI & ML interests

Stable diffusion, TTS, VC

Organizations

None yet

upvoted 3 papers 3 months ago

D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI

Paper • 2510.05684 • Published Oct 7 • 141

FuseCodec: Semantic-Contextual Fusion and Supervision for Neural Codecs

Paper • 2509.11425 • Published Sep 14 • 3

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4 • 266

upvoted 2 papers 7 months ago

Speaking Beyond Language: A Large-Scale Multimodal Dataset for Learning Nonverbal Cues from Video-Grounded Dialogues

Paper • 2506.00958 • Published Jun 1 • 20

Revisiting Residual Connections: Orthogonal Updates for Stable and Efficient Deep Networks

Paper • 2505.11881 • Published May 17 • 4

upvoted a paper 9 months ago

VisEscape: A Benchmark for Evaluating Exploration-driven Decision-making in Virtual Escape Rooms

Paper • 2503.14427 • Published Mar 18 • 19

upvoted 2 papers about 1 year ago

How Far is Video Generation from World Model: A Physical Law Perspective

Paper • 2411.02385 • Published Nov 4, 2024 • 34

Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders

Paper • 2410.22366 • Published Oct 28, 2024 • 84