Shaleen Bhartiya's picture

Shaleen Bhartiya

Shaleen123

·

https://brainwaveml.ai

AI & ML interests

None yet

Recent Activity

updated a dataset 12 days ago

BrainWave-ML/kara_think

published a dataset 13 days ago

BrainWave-ML/kara_think

upvoted a paper 16 days ago

mHC: Manifold-Constrained Hyper-Connections

View all activity

Organizations

upvoted a paper 16 days ago

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published 17 days ago • 257

upvoted 3 papers 3 months ago

Kimi Linear: An Expressive, Efficient Attention Architecture

Paper • 2510.26692 • Published Oct 30, 2025 • 120

Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning

Paper • 2510.25992 • Published Oct 29, 2025 • 46

Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning

Paper • 2510.19338 • Published Oct 22, 2025 • 114

upvoted a collection 5 months ago

BrainWave-ML

Best Models in the Game! • 10 items • Updated Sep 3, 2025 • 1

upvoted a collection 10 months ago

Llama 4

Llama 4 release • 13 items • Updated Apr 29, 2025 • 684

upvoted 2 articles 10 months ago

Article

Fine-tuning SmolLM with Group Relative Policy Optimization (GRPO) by following the Methodologies

Feb 17, 2025

•

28

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

+2

Mar 12, 2025

•

482

upvoted a paper about 1 year ago

Movie Gen: A Cast of Media Foundation Models

Paper • 2410.13720 • Published Oct 17, 2024 • 99

upvoted a paper over 1 year ago

TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text Encoder

Paper • 2409.08248 • Published Sep 12, 2024 • 16

upvoted an article over 1 year ago

Article

Memory-efficient Diffusion Transformers with Quanto and Diffusers

Jul 30, 2024

•

68

upvoted a paper over 1 year ago

The Llama 3 Herd of Models

Paper • 2407.21783 • Published Jul 31, 2024 • 117

upvoted a collection over 1 year ago

Gemma 2 2B Release

The 2.6B parameter version of Gemma 2. • 6 items • Updated Jul 10, 2025 • 83

upvoted an article over 1 year ago

Article

SmolLM - blazingly fast and remarkably powerful

+1

Jul 16, 2024

•

439

upvoted a paper over 1 year ago

DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation

Paper • 2406.16855 • Published Jun 24, 2024 • 57

upvoted a collection over 1 year ago

Meta Llama 3

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Dec 6, 2024 • 882

upvoted a paper over 1 year ago

From RAGs to rich parameters: Probing how language models utilize external knowledge over parametric information for factual queries

Paper • 2406.12824 • Published Jun 18, 2024 • 21

upvoted an article over 1 year ago

Article

Diffusers welcomes Stable Diffusion 3

+4

Jun 12, 2024

•

99

upvoted 2 papers over 1 year ago

An Introduction to Vision-Language Modeling

Paper • 2405.17247 • Published May 27, 2024 • 90

AnimateLCM: Accelerating the Animation of Personalized Diffusion Models and Adapters with Decoupled Consistency Learning

Paper • 2402.00769 • Published Feb 1, 2024 • 22