Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Benjy 's Collections
Image Editing
Multi-Recognition
Music AI
Agentic
Image-to-Video
Multimodal
Image-to-Image
Image-to-Text
Speech Recognition
Text-to-Video
OCR
Image Models
Leading Research
Coding LLMs
Text-to-Image
Small LLMs
Leading LLMs

Image-to-Text

updated Dec 24, 2024
Upvote
-

  • microsoft/OmniParser

    Image-Text-to-Text • Updated Dec 2, 2024 • 339 • 1.7k

  • Qwen/Qwen2-VL-72B

    Image-Text-to-Text • 73B • Updated Dec 6, 2024 • 113 • 80

  • Qwen/Qwen2-VL-72B-Instruct

    Image-Text-to-Text • 73B • Updated Feb 6, 2025 • 19.4k • • 308

  • Qwen/QVQ-72B-Preview

    Image-Text-to-Text • 73B • Updated Jan 12, 2025 • 234 • 609
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs