Inference Providers
Active filters: nvidia
nvidia/Kimi-K2.5-Thinking-Eagle3
Text Generation
• Updated • 508
• 10
cyankiwi/NVIDIA-Nemotron-3-Super-120B-A12B-AWQ-4bit
Text Generation
• 127B • Updated • 2.74k
• 3
nvidia/nemotron-3-8b-base-4k
Text Generation
• Updated • 1
• 101
nvidia/Llama-3.1-Nemotron-Nano-8B-v1
Text Generation
• 8B • Updated • 283k
• • 221
nvidia/Cosmos-Predict2-2B-Text2Image
Text-to-Image
• Updated • 164
• 72
NVFP4/Qwen3-Coder-30B-A3B-Instruct-FP4
Text Generation
• 16B • Updated • 32k
• 19
nvidia/Nemotron-Terminal-8B
Text Generation
• 8B • Updated • 2.88k
• 28
nvidia/Nemotron-Terminal-14B
Text Generation
• 15B • Updated • 510
• 10
nvidia/Nemotron-Terminal-32B
Text Generation
• 33B • Updated • 1.45k
• 34
Updated • 796
• 2
berkerdooo/Qwen3.5-27B-NVFP4
Image-Text-to-Text
• 17B • Updated • 1.36k
• 2
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-Base-BF16
Text Generation
• 124B • Updated • 9.54k
• 21
bartowski/nvidia_Nemotron-3-Super-120B-A12B-GGUF
Text Generation
• 121B • Updated • 10.7k
• 6
unsloth/NVIDIA-Nemotron-3-Nano-4B-FP8
Text Generation
• 4B • Updated • 587
• 2
Updated • 29
• 2
nvidia/Kimodo-SMPLX-RP-v1
Updated • 103
• 2
mradermacher/NVIDIA-Nemotron-3-Super-120B-A12B-BF16-heretic-GGUF
121B • Updated • 3.47k
• 2
bartowski/nvidia_Nemotron-3-Nano-4B-GGUF
Text Generation
• 4B • Updated • 2.29k
• 2
mradermacher/NVIDIA-Nemotron-3-Super-120B-A12B-BF16-heretic-i1-GGUF
121B • Updated • 20.3k
• 2
mradermacher/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16-heretic-i1-GGUF
32B • Updated • 3.71k
• 2
mlx-community/Nemotron-Cascade-2-30B-A3B-8bit
Text Generation
• 32B • Updated • 646
• 2
freddm/Nemotron-Cascade-2-30B-A3B-GGUF
Text Generation
• 32B • Updated • 724
• 2
nvidia/Nemotron-Mini-4B-Instruct
Text Generation
• Updated • 13.7k
• 178
nvidia/OpenMath2-Llama3.1-8B-nemo
nvidia/Llama-3.1-Nemotron-70B-Instruct-HF
Text Generation
• 71B • Updated • 11k
• 2.06k
bartowski/Llama-3.1-Nemotron-70B-Instruct-HF-GGUF
Text Generation
• 71B • Updated • 4.28k
• 103
nvidia/Nemotron-H-8B-Base-8K
Text Generation
• Updated • 18.1k
• 55
nvidia/Llama-4-Scout-17B-16E-Instruct-NVFP4
56B • Updated • 49.5k
• 27
nvidia/Nemotron-H-4B-Instruct-128K
Text Generation
• 4B • Updated • 64.2k
• 9