Leon Tsou's picture

Leon Tsou

xxrjun

·

AI & ML interests

None yet

Recent Activity

new activity about 2 months ago

nvidia/DeepSeek-R1-0528-NVFP4:What does “AA Ref” mean in NVIDIA model benchmarks?

liked a Space about 2 months ago

HuggingFaceTB/smol-training-playbook

liked a model 3 months ago

deepseek-ai/DeepSeek-R1-0528

View all activity

Organizations

upvoted an article 4 months ago

Article

Learn the Hugging Face Kernel Hub in 5 Minutes

+5

Jun 12

•

151

upvoted 3 collections 11 months ago

InternVL2.5

Better than InternVL 2.0 • 19 items • Updated Sep 28 • 92

Taiwan LLM

Try out at twllm.com ! • 29 items • Updated Nov 6 • 51

DeepSeek-R1

10 items • Updated Nov 27 • 825

upvoted a paper about 1 year ago

BGE M3-Embedding: Multi-Lingual, Multi-Functionality, Multi-Granularity Text Embeddings Through Self-Knowledge Distillation

Paper • 2402.03216 • Published Feb 5, 2024 • 6

upvoted a collection over 1 year ago

Preference Datasets for DPO

This collection contains a list of curated preference datasets for DPO fine-tuning for intent alignment of LLMs • 7 items • Updated Dec 11, 2024 • 46

upvoted an article over 1 year ago

Article

Efficient Deep Learning: A Comprehensive Overview of Optimization Techniques 👐 📚

Aug 26, 2024

•

82

upvoted a collection over 1 year ago

Jamba 1.5

The AI21 Jamba family of models are state-of-the-art, hybrid SSM-Transformer instruction following foundation models • 2 items • Updated Mar 6 • 87

upvoted 3 papers over 1 year ago

Proximal Policy Optimization Algorithms

Paper • 1707.06347 • Published Jul 20, 2017 • 11

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22, 2024 • 259

Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment

Paper • 2408.06266 • Published Aug 12, 2024 • 10

upvoted a collection over 1 year ago

Preference Datasets for KTO

This collection contains a list of curated preference datasets for KTO fine-tuning for intent alignment of LLMs through signals. • 5 items • Updated Dec 11, 2024 • 15

upvoted a paper over 1 year ago

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Paper • 2305.18290 • Published May 29, 2023 • 64

upvoted 6 collections over 1 year ago

Video

Stability AI's suite of image-to-video models • 7 items • Updated Nov 14 • 88

Transformers compatible Mamba

This release includes the `mamba` repositories compatible with the `transformers` library • 5 items • Updated Mar 6, 2024 • 39

Phi-3

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated May 1 • 574

Llama 3.1

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 700

DCLM

DCLM Models + Datasets • 7 items • Updated Jul 22, 2024 • 44

Meta Llama 3

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Dec 6, 2024 • 875

upvoted an article over 1 year ago

Article

Everything About Long Context Fine-tuning

May 10, 2024

•

53