view article Article Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face +3 Jul 29 • 203
MT-R1-Zero: Advancing LLM-based Machine Translation via R1-Zero-like Reinforcement Learning Paper • 2504.10160 • Published Apr 14 • 2
AI-Agent-4-Industry-4.0 Collection This category highlights the collective efforts of the AI Automation team in advancing Industry 4.0 applications and exploring innovations beyond it. • 6 items • Updated Oct 8 • 6
xLAM models Collection xLAM: A Family of Large Action Models to Empower AI Agent Systems: https://github.com/SalesforceAIResearch/xLAM • 22 items • Updated Nov 5 • 59
Reasoning Efficiency Research Collection Ultra-efficient reasoning model! SOTA Accuracy / CoT Length trade-offs • 3 items • Updated 8 days ago • 10
SmolLM3 pretraining datasets Collection datasets used in SmolLM3 pretraining • 15 items • Updated Aug 12 • 40
Instruction-Following Evaluation in Function Calling for Large Language Models Paper • 2509.18420 • Published Sep 22 • 1
view article Article Fine-tuning LLMs to 1.58bit: extreme quantization made easy +4 Sep 18, 2024 • 273
🧠 Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19 • 175