-
-
-
-
-
-
Inference Providers
Active filters:
nvidia
8B
•
Updated
•
872
•
25
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8
Text Generation
•
32B
•
Updated
•
741k
•
•
266
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16
Text Generation
•
32B
•
Updated
•
469k
•
596
unsloth/Nemotron-3-Nano-30B-A3B-GGUF
Text Generation
•
32B
•
Updated
•
102k
•
234
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-Base-BF16
Text Generation
•
32B
•
Updated
•
33.7k
•
99
Image-Text-to-Text
•
9B
•
Updated
•
85.1k
•
84
nvidia/NVIDIA-Nemotron-Nano-9B-v2
Text Generation
•
9B
•
Updated
•
115k
•
468
nvidia/Qwen3-Next-80B-A3B-Thinking-NVFP4
Text Generation
•
Updated
•
12.5k
•
21
mradermacher/Huihui-NVIDIA-Nemotron-Nano-9B-v2-abliterated-i1-GGUF
9B
•
Updated
•
3.3k
•
8
nvidia/Nemotron-Cascade-8B
Text Generation
•
8B
•
Updated
•
4.06k
•
59
Image-Text-to-Text
•
8B
•
Updated
•
47.5k
•
232
nvidia/Qwen3-Nemotron-235B-A22B-GenRM
Text Generation
•
235B
•
Updated
•
245
•
20
nvidia/Nemotron-Cascade-14B-Thinking
Text Generation
•
15B
•
Updated
•
7.49k
•
69
nvidia/Qwen3-Next-80B-A3B-Instruct-NVFP4
Text Generation
•
Updated
•
14.4k
•
15
nvidia/DeepSeek-V3.2-NVFP4
Text Generation
•
394B
•
Updated
•
1.21k
•
3
nvidia/Cosmos-Predict2-2B-Video2World
Image-to-Video
•
Updated
•
2.3k
•
38
nvidia/Qwen3-235B-A22B-NVFP4
Text Generation
•
133B
•
Updated
•
5.29k
•
13
NVFP4/Qwen3-Coder-30B-A3B-Instruct-FP4
Text Generation
•
16B
•
Updated
•
3.98k
•
6
Text Generation
•
5B
•
Updated
•
6.72k
•
12
nvidia/NVIDIA-Nemotron-Parse-v1.1
Image-Text-to-Text
•
Updated
•
104k
•
131
nvidia/Kimi-K2-Thinking-NVFP4
Text Generation
•
Updated
•
30.6k
•
19
nvidia/KVzap-mlp-Qwen3-8B
Other
•
75.7M
•
Updated
•
19.1k
•
3
nvidia/KVzap-mlp-Qwen3-32B
Other
•
0.2B
•
Updated
•
69
•
4
amihai4by/logic-reasoner-v2
Text Generation
•
8B
•
Updated
•
48
•
2
nvidia/NV-Llama2-13B-RLHF-RM
Text Generation
•
Updated
•
14
•
4
nvidia/Nemotron-Mini-4B-Instruct
Text Generation
•
Updated
•
13.3k
•
175
nvidia/AceMath-7B-Instruct
Text Generation
•
8B
•
Updated
•
703
•
•
30
Updated
•
13.5k
•
16
nvidia/Llama-3.1-Nemotron-Nano-8B-v1
Text Generation
•
8B
•
Updated
•
14.6k
•
•
216
nvidia/Llama-4-Scout-17B-16E-Instruct-NVFP4
56B
•
Updated
•
35.7k
•
19