Running Featured 160 SmolVLM realtime WebGPU ⚡ 160 Start camera to get descriptions based on instructions
Running Featured 216 Janus Pro WebGPU 🏛 216 In-browser unified multimodal understanding and generation.
meta-llama/Llama-3.2-11B-Vision-Instruct Image-Text-to-Text • 11B • Updated Dec 4, 2024 • 104k • • 1.55k