S.F.

search-facility

ipv6

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Representation Alignment for Just Image Transformers is not Easier than You Think

upvoted a paper 2 days ago

Less Gaussians, Texture More: 4K Feed-Forward Textured Splatting

upvoted a paper 2 days ago

AVControl: Efficient Framework for Training Audio-Visual Controls

View all activity

Organizations

None yet

upvoted 5 papers 2 days ago

Representation Alignment for Just Image Transformers is not Easier than You Think

Paper • 2603.14366 • Published 15 days ago • 9

upvoted a paper 5 days ago

Repurposing Geometric Foundation Models for Multi-view Diffusion

Paper • 2603.22275 • Published 7 days ago • 44

upvoted a paper 6 days ago

FlowScene: Style-Consistent Indoor Scene Generation with Multimodal Graph Rectified Flow

Paper • 2603.19598 • Published 10 days ago • 32

upvoted 2 papers 10 days ago

LoST: Level of Semantics Tokenization for 3D Shapes

Paper • 2603.17995 • Published 12 days ago • 31

Complementary Reinforcement Learning

Paper • 2603.17621 • Published 12 days ago • 36

upvoted a paper 11 days ago

InCoder-32B: Code Foundation Model for Industrial Scenarios

Paper • 2603.16790 • Published 13 days ago • 303

upvoted a paper 12 days ago

Mixture-of-Depths Attention

Paper • 2603.15619 • Published 14 days ago • 79

commented a paper 12 days ago

Attention Residuals

Paper • 2603.15031 • Published 14 days ago • 167 •

upvoted 2 papers 12 days ago

Attention Residuals

Paper • 2603.15031 • Published 14 days ago • 167

Grounding World Simulation Models in a Real-World Metropolis

Paper • 2603.15583 • Published 14 days ago • 149

upvoted a paper 13 days ago

OmniForcing: Unleashing Real-time Joint Audio-Visual Generation

Paper • 2603.11647 • Published 18 days ago • 31

upvoted 3 papers 18 days ago

Fish Audio S2 Technical Report

Paper • 2603.08823 • Published 21 days ago • 36

Reading, Not Thinking: Understanding and Bridging the Modality Gap When Text Becomes Pixels in Multimodal LLMs

Paper • 2603.09095 • Published 20 days ago • 28

Omni-Diffusion: Unified Multimodal Understanding and Generation with Masked Discrete Diffusion

Paper • 2603.06577 • Published 24 days ago • 48

upvoted 2 papers 19 days ago

Holi-Spatial: Evolving Video Streams into Holistic 3D Spatial Intelligence

Paper • 2603.07660 • Published 22 days ago • 84

Lost in Stories: Consistency Bugs in Long Story Generation by LLMs

Paper • 2603.05890 • Published 24 days ago • 91

S.F.

AI & ML interests

Recent Activity

Organizations

search-facility's activity