Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Yong Lin's picture
7 1

Yong Lin

linyongver
ljupco's profile picture bmorphism's profile picture 21world's profile picture
·

AI & ML interests

None yet

Organizations

Directional Preference Alignment's profile picture Goedel-LM's profile picture

upvoted a paper 3 months ago

GAR: Generative Adversarial Reinforcement Learning for Formal Theorem Proving

Paper • 2510.11769 • Published Oct 13, 2025 • 25
upvoted a paper 6 months ago

Chain-of-Experts: Unlocking the Communication Power of Mixture-of-Experts Models

Paper • 2506.18945 • Published Jun 23, 2025 • 40
upvoted a paper 7 months ago

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published May 30, 2025 • 143
upvoted a paper 9 months ago

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published Apr 17, 2025 • 93
upvoted a paper 10 months ago

Predictive Data Selection: The Data That Predicts Is the Data That Teaches

Paper • 2503.00808 • Published Mar 2, 2025 • 56
upvoted a paper over 1 year ago

RLHF Workflow: From Reward Modeling to Online RLHF

Paper • 2405.07863 • Published May 13, 2024 • 71
upvoted a collection over 1 year ago

Standard-format-preference-dataset

Collection
We collect the open-source datasets and process them into the standard format. • 14 items • Updated May 8, 2024 • 26
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs