Kashif Rasul's picture

Kashif Rasul

kashif

·

AI & ML interests

Time Series Forecasting, Denoising Diffusion, Generative Modeling, Reinforcement Learning

Recent Activity

liked a Space 2 days ago

multimodalart/LLaDA-2-1

new activity 3 days ago

kuleshov-group/bd3lm-owt-block_size16:Add post_init() and register_buffer(persistent=False) for transformers v5

new activity 3 days ago

kuleshov-group/bd3lm-owt-block_size8:Add post_init() and register_buffer(persistent=False) for transformers v5

View all activity

Organizations

published an article 19 days ago

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

+7

19 days ago

•

79

published an article 20 days ago

Article

Ulysses Sequence Parallelism: Training with Million-Token Contexts

20 days ago

•

24

published an article 8 months ago

Article

Vision Language Model Alignment in TRL ⚡️

+3

Aug 7, 2025

•

109

published an article 9 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

+21

Jul 8, 2025

•

767

published an article 9 months ago

Article

Gemma 3n fully available in the open-source ecosystem!

+6

Jun 26, 2025

•

121

published an article 10 months ago

Article

KV Cache from scratch in nanoVLM

+3

Jun 4, 2025

•

114

published an article 10 months ago

Article

🐯 Liger GRPO meets TRL

+4

May 25, 2025

•

53

published an article over 1 year ago

Article

How NuminaMath Won the 1st AIMO Progress Prize

+6

Jul 11, 2024

•

128

published an article over 1 year ago

Article

Preference Optimization for Vision Language Models

+2

Jul 10, 2024

•

93

published an article almost 2 years ago

Article

Diffusers welcomes Stable Diffusion 3

+4

Jun 12, 2024

•

99

published an article about 2 years ago

Article

Constitutional AI with Open LLMs

+5

Feb 1, 2024

•

17

published an article about 2 years ago

Article

Patch Time Series Transformer in Hugging Face

+3

Feb 1, 2024

•

14

published an article about 2 years ago

Article

Constitutional AI with Open LLMs

+5

Feb 1, 2024

•

17

published an article about 2 years ago

Article

PatchTSMixer in HuggingFace

+4

Jan 19, 2024

•

10

published an article about 2 years ago

Article

Preference Tuning LLMs with Direct Preference Optimization Methods

+3

Jan 18, 2024

•

80

published an article over 2 years ago

Article

Finetune Stable Diffusion Models with DDPO via TRL

+2

Sep 29, 2023

•

20

published an article over 2 years ago

Article

Finetune Stable Diffusion Models with DDPO via TRL

+2

Sep 29, 2023

•

20

published an article over 2 years ago

Article

Introducing Würstchen: Fast Diffusion for Image Generation

+3

Sep 13, 2023

•

21

published an article over 2 years ago

Article

Fine-tune Llama 2 with DPO

+1

Aug 8, 2023

•

68