Collections

Discover the best community collections!

Collections including paper arxiv:2501.04001
cabinet
Collection by Sep 8
Image-Video General Tasks
Collection by Nov 3
Sa2VA Model Zoo
Huggingace Model Zoo For Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos By Bytedance Seed CV Research
VLM papers
Collection by Mar 9
VLM
Collection by Jan 14
Papers
Collection by about 8 hours ago
Multimodal Language Model
What does matter besides data receipt when training a Multimodal language model?
cabinet
Collection by Sep 8
Image-Video General Tasks
Collection by Nov 3
VLM
Collection by Jan 14
Sa2VA Model Zoo
Huggingace Model Zoo For Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos By Bytedance Seed CV Research
Papers
Collection by about 8 hours ago
VLM papers
Collection by Mar 9
Multimodal Language Model
What does matter besides data receipt when training a Multimodal language model?