Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 44
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
sentence-transformers
Safetensors
ONNX
GGUF
Transformers.js
MLX
+ 41
Apps
vLLM
llama.cpp
MLX LM
LM Studio
Ollama
Jan
Draw Things
+ 7
Inference Providers
Groq
Novita
Cerebras
SambaNova
Nscale
fal
Hyperbolic
Together AI
+ 10
Apply filters
Models
19,722
Full-text search
Inference Available
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
moonshotai/Kimi-K2.5
Image-Text-to-Text
•
171B
•
Updated
8 days ago
•
648k
•
•
2.08k
internlm/Intern-S1-Pro
Image-Text-to-Text
•
Updated
about 16 hours ago
•
13.5k
•
248
AIDC-AI/Ovis2.6-30B-A3B
Image-Text-to-Text
•
31B
•
Updated
about 14 hours ago
•
49
deepseek-ai/DeepSeek-OCR-2
Image-Text-to-Text
•
3B
•
Updated
10 days ago
•
888k
•
741
PaddlePaddle/PaddleOCR-VL-1.5
Image-Text-to-Text
•
1.0B
•
Updated
14 days ago
•
12.7k
•
388
trillionlabs/gWorld-8B
Image-Text-to-Text
•
9B
•
Updated
9 days ago
•
497
•
48
google/medgemma-1.5-4b-it
Image-Text-to-Text
•
4B
•
Updated
20 days ago
•
173k
•
439
Qwen/Qwen3-VL-8B-Instruct
Image-Text-to-Text
•
9B
•
Updated
Oct 15, 2025
•
2.89M
•
•
751
lightonai/LightOnOCR-2-1B
Image-Text-to-Text
•
1B
•
Updated
10 days ago
•
201k
•
515
google/gemma-3-4b-it
Image-Text-to-Text
•
4B
•
Updated
Mar 21, 2025
•
1.17M
•
1.18k
google/translategemma-4b-it
Image-Text-to-Text
•
5B
•
Updated
15 days ago
•
128k
•
614
bakrianoo/arabic-legal-documents-ocr-1.0
Image-Text-to-Text
•
4B
•
Updated
8 days ago
•
1k
•
19
inclusionAI/UI-Venus-1.5-8B
Image-Text-to-Text
•
9B
•
Updated
1 day ago
•
255
•
18
LiquidAI/LFM2.5-VL-1.6B
Image-Text-to-Text
•
2B
•
Updated
7 days ago
•
94.7k
•
217
nvidia/Cosmos-Reason2-8B
Image-Text-to-Text
•
9B
•
Updated
13 days ago
•
192k
•
120
inclusionAI/UI-Venus-1.5-30B-A3B
Image-Text-to-Text
•
31B
•
Updated
1 day ago
•
185
•
16
openbmb/MiniCPM-V-4_5
Image-Text-to-Text
•
9B
•
Updated
Dec 18, 2025
•
74.3k
•
1.07k
p-e-w/gemma-3-12b-it-heretic
Image-Text-to-Text
•
12B
•
Updated
Nov 15, 2025
•
652
•
41
trillionlabs/gWorld-32B
Image-Text-to-Text
•
33B
•
Updated
9 days ago
•
264
•
26
inclusionAI/UI-Venus-1.5-2B
Image-Text-to-Text
•
2B
•
Updated
1 day ago
•
290
•
14
deepseek-ai/DeepSeek-OCR
Image-Text-to-Text
•
3B
•
Updated
Nov 4, 2025
•
3.02M
•
3.14k
stepfun-ai/Step3-VL-10B
Image-Text-to-Text
•
10B
•
Updated
9 days ago
•
91.3k
•
394
Alibaba-DAMO-Academy/RynnBrain-2B
Image-Text-to-Text
•
2B
•
Updated
3 days ago
•
85
•
13
rednote-hilab/dots.ocr
Image-Text-to-Text
•
3B
•
Updated
Oct 31, 2025
•
260k
•
1.23k
Qwen/Qwen3-VL-2B-Instruct
Image-Text-to-Text
•
2B
•
Updated
Oct 23, 2025
•
1.82M
•
317
datalab-to/chandra
Image-Text-to-Text
•
9B
•
Updated
Oct 21, 2025
•
276k
•
483
google/gemma-3-27b-it
Image-Text-to-Text
•
27B
•
Updated
Mar 21, 2025
•
1.63M
•
•
1.86k
google/medgemma-4b-it
Image-Text-to-Text
•
4B
•
Updated
Oct 28, 2025
•
364k
•
887
Qwen/Qwen3-VL-4B-Instruct
Image-Text-to-Text
•
4B
•
Updated
Oct 15, 2025
•
924k
•
331
ibm-granite/granite-docling-258M
Image-Text-to-Text
•
0.3B
•
Updated
Sep 23, 2025
•
205k
•
1.12k
Previous
1
2
3
...
100
Next