rscr's picture

2 12

rscr

rscr

·

AI & ML interests

None yet

Organizations

None yet

upvoted a paper 7 months ago

Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models

Paper • 2506.06395 • Published Jun 5, 2025 • 133

upvoted an article 9 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

+4

Sep 18, 2024

•

272