BGE M3-Embedding: Multi-Lingual, Multi-Functionality, Multi-Granularity Text Embeddings Through Self-Knowledge Distillation Paper β’ 2402.03216 β’ Published Feb 5, 2024 β’ 6
Preference Datasets for DPO Collection This collection contains a list of curated preference datasets for DPO fine-tuning for intent alignment of LLMs β’ 7 items β’ Updated Dec 11, 2024 β’ 46
view article Article Efficient Deep Learning: A Comprehensive Overview of Optimization Techniques π π Aug 26, 2024 β’ 82
Jamba 1.5 Collection The AI21 Jamba family of models are state-of-the-art, hybrid SSM-Transformer instruction following foundation models β’ 2 items β’ Updated Mar 6 β’ 87
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone Paper β’ 2404.14219 β’ Published Apr 22, 2024 β’ 259
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment Paper β’ 2408.06266 β’ Published Aug 12, 2024 β’ 10
Preference Datasets for KTO Collection This collection contains a list of curated preference datasets for KTO fine-tuning for intent alignment of LLMs through signals. β’ 5 items β’ Updated Dec 11, 2024 β’ 15
Direct Preference Optimization: Your Language Model is Secretly a Reward Model Paper β’ 2305.18290 β’ Published May 29, 2023 β’ 64
Transformers compatible Mamba Collection This release includes the `mamba` repositories compatible with the `transformers` library β’ 5 items β’ Updated Mar 6, 2024 β’ 39
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. β’ 26 items β’ Updated May 1 β’ 574
Llama 3.1 Collection This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models β’ 11 items β’ Updated Dec 6, 2024 β’ 700
Meta Llama 3 Collection This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases β’ 5 items β’ Updated Dec 6, 2024 β’ 875