Yong Lin's picture

7 1

Yong Lin

linyongver

·

AI & ML interests

None yet

Organizations

upvoted a paper 3 months ago

GAR: Generative Adversarial Reinforcement Learning for Formal Theorem Proving

Paper • 2510.11769 • Published Oct 13, 2025 • 25

upvoted 2 papers 7 months ago

Chain-of-Experts: Unlocking the Communication Power of Mixture-of-Experts Models

Paper • 2506.18945 • Published Jun 23, 2025 • 40

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published May 30, 2025 • 143

upvoted a paper 9 months ago

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published Apr 17, 2025 • 93

upvoted a paper 10 months ago

Predictive Data Selection: The Data That Predicts Is the Data That Teaches

Paper • 2503.00808 • Published Mar 2, 2025 • 56

upvoted a paper over 1 year ago

RLHF Workflow: From Reward Modeling to Online RLHF

Paper • 2405.07863 • Published May 13, 2024 • 71

upvoted a collection over 1 year ago

Standard-format-preference-dataset

We collect the open-source datasets and process them into the standard format. • 14 items • Updated May 8, 2024 • 26