MathRL Collection Note: The solution may not be in `solution` or `answer` columns, but inside /boxed/{ANSWER} • 13 items • Updated Aug 16 • 1
Reward Models 06-2025 Collection Nemotron reward models. For use in RLHF pipelines and LLM-as-a-Judge • 8 items • Updated 7 days ago • 22
Models I WIll GGUF Collection MODELS MUST BE <=22B. To add to this open this link: https://huggingface.co/collections/ReallyFloppyPenguin/models2gguflater-68503439edc1aa25cce7c79b • 0 items • Updated Jun 23 • 1
On Path to Multimodal Generalist: General-Level and General-Bench Paper • 2505.04620 • Published May 7 • 82
Absolute Zero: Reinforced Self-play Reasoning with Zero Data Paper • 2505.03335 • Published May 6 • 188
ZeroGPU Spaces Collection ZeroGPU Spaces made by the community • 17 items • Updated Jun 6, 2024 • 246