ocr tencent/HunyuanOCR Image-Text-to-Text • Updated Jan 13 • 421k • 554 opendatalab/MinerU2.5-2509-1.2B Image-Text-to-Text • 1B • Updated Sep 29, 2025 • 90.3k • 340 PaddlePaddle/PaddleOCR-VL-1.5 Image-Text-to-Text • 1.0B • Updated 2 days ago • 28.4k • 484 PaddlePaddle/PaddleOCR-VL Image-Text-to-Text • 1.0B • Updated 2 days ago • 8.16k • 1.57k
asr FireRedTeam/FireRedASR-AED-L Automatic Speech Recognition • Updated Mar 5, 2025 • 130 • 67 microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 313k • 1.57k
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 313k • 1.57k
ocr tencent/HunyuanOCR Image-Text-to-Text • Updated Jan 13 • 421k • 554 opendatalab/MinerU2.5-2509-1.2B Image-Text-to-Text • 1B • Updated Sep 29, 2025 • 90.3k • 340 PaddlePaddle/PaddleOCR-VL-1.5 Image-Text-to-Text • 1.0B • Updated 2 days ago • 28.4k • 484 PaddlePaddle/PaddleOCR-VL Image-Text-to-Text • 1.0B • Updated 2 days ago • 8.16k • 1.57k
asr FireRedTeam/FireRedASR-AED-L Automatic Speech Recognition • Updated Mar 5, 2025 • 130 • 67 microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 313k • 1.57k
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 313k • 1.57k