Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Yufei Huang's picture

3 3

Yufei Huang

huangyf530

21world's profile picture

AdinaY's profile picture

·

https://huangyf530.github.io

yufei_huang1999
huangyf530

AI & ML interests

Natural Language Processing, Question Answering

Organizations

None yet

huangyf530 's collections 6

Trusted Source Alignment in Large Language Models

Paper • 2311.06697 • Published Nov 12, 2023 • 12
Fine-tuning Language Models for Factuality

Paper • 2311.08401 • Published Nov 14, 2023 • 30

Routing to the Expert: Efficient Reward-guided Ensemble of Large Language Models

Paper • 2311.08692 • Published Nov 15, 2023 • 13
DiLoCo: Distributed Low-Communication Training of Language Models

Paper • 2311.08105 • Published Nov 14, 2023 • 16
System 2 Attention (is something you might need too)

Paper • 2311.11829 • Published Nov 20, 2023 • 43
Order Matters in the Presence of Dataset Imbalance for Multilingual Learning

Paper • 2312.06134 • Published Dec 11, 2023 • 3

Unlocking Anticipatory Text Generation: A Constrained Approach for Faithful Decoding with Large Language Models

Paper • 2312.06149 • Published Dec 11, 2023 • 3
Context Tuning for Retrieval Augmented Generation

Paper • 2312.05708 • Published Dec 9, 2023 • 16

Contrastive Chain-of-Thought Prompting

Paper • 2311.09277 • Published Nov 15, 2023 • 35
Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models

Paper • 2312.06585 • Published Dec 11, 2023 • 29
Modeling Complex Mathematical Reasoning via Large Language Model based MathAgent

Paper • 2312.08926 • Published Dec 14, 2023 • 9

Fast Chain-of-Thought: A Glance of Future from Parallel Decoding Leads to Answers Faster

Paper • 2311.08263 • Published Nov 14, 2023 • 16
SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention

Paper • 2312.07987 • Published Dec 13, 2023 • 41

Using Captum to Explain Generative Language Models

Paper • 2312.05491 • Published Dec 9, 2023 • 4

Trusted Source Alignment in Large Language Models

Paper • 2311.06697 • Published Nov 12, 2023 • 12
Fine-tuning Language Models for Factuality

Paper • 2311.08401 • Published Nov 14, 2023 • 30

Contrastive Chain-of-Thought Prompting

Paper • 2311.09277 • Published Nov 15, 2023 • 35
Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models

Paper • 2312.06585 • Published Dec 11, 2023 • 29
Modeling Complex Mathematical Reasoning via Large Language Model based MathAgent

Paper • 2312.08926 • Published Dec 14, 2023 • 9

Routing to the Expert: Efficient Reward-guided Ensemble of Large Language Models

Paper • 2311.08692 • Published Nov 15, 2023 • 13
DiLoCo: Distributed Low-Communication Training of Language Models

Paper • 2311.08105 • Published Nov 14, 2023 • 16
System 2 Attention (is something you might need too)

Paper • 2311.11829 • Published Nov 20, 2023 • 43
Order Matters in the Presence of Dataset Imbalance for Multilingual Learning

Paper • 2312.06134 • Published Dec 11, 2023 • 3

Fast Chain-of-Thought: A Glance of Future from Parallel Decoding Leads to Answers Faster

Paper • 2311.08263 • Published Nov 14, 2023 • 16
SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention

Paper • 2312.07987 • Published Dec 13, 2023 • 41

Unlocking Anticipatory Text Generation: A Constrained Approach for Faithful Decoding with Large Language Models

Paper • 2312.06149 • Published Dec 11, 2023 • 3
Context Tuning for Retrieval Augmented Generation

Paper • 2312.05708 • Published Dec 9, 2023 • 16

Using Captum to Explain Generative Language Models

Paper • 2312.05491 • Published Dec 9, 2023 • 4

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs