hamadbijarani012/TinyBERT_General_4L_312D_distilled_BERT_base_uncased_SST2_int8 Text Classification • 14.4M • Updated 19 days ago • 31
hamadbijarani012/TinyBERT_General_4L_312D_distilled_BERT_base_uncased_SST2_int8 Text Classification • 14.4M • Updated 19 days ago • 31
hamadbijarani012/TinyBERT_General_4L_312D_pretrained_SST2 Text Classification • 14.4M • Updated about 1 month ago • 90
hamadbijarani012/TinyBERT_General_4L_312D_pretrained_SST2 Text Classification • 14.4M • Updated about 1 month ago • 90
hamadbijarani012/TinyBERT_General_4L_312D_distilled_BERT_base_uncased_SST2 Text Classification • 14.4M • Updated Dec 6, 2025 • 14
hamadbijarani012/TinyBERT_General_4L_312D_distilled_BERT_base_uncased_SST2 Text Classification • 14.4M • Updated Dec 6, 2025 • 14
Running on CPU Upgrade Featured 2.81k The Smol Training Playbook 📚 2.81k The secrets to building world-class LLMs
LLMs4All: A Review on Large Language Models for Research and Applications in Academic Disciplines Paper • 2509.19580 • Published Sep 23, 2025 • 13
rStar2-Agent: Agentic Reasoning Technical Report Paper • 2508.20722 • Published Aug 28, 2025 • 116
Speed Always Wins: A Survey on Efficient Architectures for Large Language Models Paper • 2508.09834 • Published Aug 13, 2025 • 53
On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification Paper • 2508.05629 • Published Aug 7, 2025 • 180