Yufei Huang
huangyf530
AI & ML interests
Natural Language Processing, Question Answering
Organizations
None yet
LLM
-
Routing to the Expert: Efficient Reward-guided Ensemble of Large Language Models
Paper • 2311.08692 • Published • 13 -
DiLoCo: Distributed Low-Communication Training of Language Models
Paper • 2311.08105 • Published • 16 -
System 2 Attention (is something you might need too)
Paper • 2311.11829 • Published • 43 -
Order Matters in the Presence of Dataset Imbalance for Multilingual Learning
Paper • 2312.06134 • Published • 3
Faithfulness
Reasoning
-
Contrastive Chain-of-Thought Prompting
Paper • 2311.09277 • Published • 35 -
Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models
Paper • 2312.06585 • Published • 29 -
Modeling Complex Mathematical Reasoning via Large Language Model based MathAgent
Paper • 2312.08926 • Published • 9
Efficiency
Explainability
Alignment
Reasoning
-
Contrastive Chain-of-Thought Prompting
Paper • 2311.09277 • Published • 35 -
Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models
Paper • 2312.06585 • Published • 29 -
Modeling Complex Mathematical Reasoning via Large Language Model based MathAgent
Paper • 2312.08926 • Published • 9
LLM
-
Routing to the Expert: Efficient Reward-guided Ensemble of Large Language Models
Paper • 2311.08692 • Published • 13 -
DiLoCo: Distributed Low-Communication Training of Language Models
Paper • 2311.08105 • Published • 16 -
System 2 Attention (is something you might need too)
Paper • 2311.11829 • Published • 43 -
Order Matters in the Presence of Dataset Imbalance for Multilingual Learning
Paper • 2312.06134 • Published • 3
Efficiency
Faithfulness
Explainability