Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Benjy
's Collections
Image Editing
Multi-Recognition
Music AI
Agentic
Image-to-Video
Multimodal
Image-to-Image
Image-to-Text
Speech Recognition
Text-to-Video
OCR
Image Models
Leading Research
Coding LLMs
Text-to-Image
Small LLMs
Leading LLMs
Image-to-Text
updated
Dec 24, 2024
Upvote
-
microsoft/OmniParser
Image-Text-to-Text
•
Updated
Dec 2, 2024
•
339
•
1.7k
Qwen/Qwen2-VL-72B
Image-Text-to-Text
•
73B
•
Updated
Dec 6, 2024
•
113
•
80
Qwen/Qwen2-VL-72B-Instruct
Image-Text-to-Text
•
73B
•
Updated
Feb 6, 2025
•
19.4k
•
•
308
Qwen/QVQ-72B-Preview
Image-Text-to-Text
•
73B
•
Updated
Jan 12, 2025
•
234
•
609
Upvote
-
Share collection
View history
Collection guide
Browse collections