-
-
-
-
-
-
Inference Providers
Active filters:
open-r1
Qucy/Qwen2.5-0.5B-Open-R1-Distill
Text Generation
•
0.5B
•
Updated
•
2
ununtrium/Qwen2.5-1.5B-Instruct-Open-R1-GRPO-gsm8k2
Text Generation
•
2B
•
Updated
•
1
ashokvaktariya1/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
2B
•
Updated
•
1
hwang595/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
2B
•
Updated
•
1
skzxjus/Qwen-2.5-7B-Simple-RL
Text Generation
•
8B
•
Updated
•
2
bruel/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
2B
•
Updated
•
3
foxiwift/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
2B
•
Updated
•
2
JeffP111/Qwen-2.5-3B-Simple-RL
Text Generation
•
3B
•
Updated
•
6
nzy123/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
0.5B
•
Updated
•
11
Thomas-Chou/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
2B
•
Updated
•
12
JasonWangVG/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
2B
•
Updated
•
1
Thomas-Chou/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
2B
•
Updated
•
5
princepride/Qwen-2.5-7B-Simple-RL-Test
Text Generation
•
8B
•
Updated
•
4
Qucy/Qwen2.5-0.5B-Open-R1-Distill-Fin
Text Generation
•
0.5B
•
Updated
•
7
Dongwei/Qwen-2.5-7B_Base_Math_smalllr_longer
Text Generation
•
8B
•
Updated
•
1
yolay/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
2B
•
Updated
Lansechen/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
2B
•
Updated
•
3
•
1
ibndias/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
2B
•
Updated
•
2
HaichuanWang/Qwen2.5-32B-Open-R1-Distill
Text Generation
•
8B
•
Updated
•
2
allendou/Qwen2.5-0.5B-Open-R1-Distill
Text Generation
•
0.5B
•
Updated
mradermacher/Quasar-2.0-7B-GGUF
8B
•
Updated
•
42
•
1
bluuluu/Qwen2.5-1.5B-Open-R1-Distill
coco3143/Qwen2.5-0.5B-Open-R1-GRPO
Text Generation
•
0.5B
•
Updated
•
2
Penghe/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
2B
•
Updated
Cys621/Qwen2.5-0.5B-Open-R1-GRPO
Text Generation
•
0.5B
•
Updated
•
5
•
princepride/Qwen-2.5-7B-Simple-RL
Text Generation
•
8B
•
Updated
•
3
mradermacher/Quasar-2.0-7B-i1-GGUF
8B
•
Updated
•
21
•
1
Dongwei/Qwen-2.5-7B_Base_Math_smalllr_newdata
Text Generation
•
8B
•
Updated
•
2
namazifar/Qwen-2.5-7B-Simple-RL
Text Generation
•
9B
•
Updated
AlejandroOlmedo/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math-8bit-mlx
Text Generation
•
2B
•
Updated
•
65
•
3