47 33 81

Kashif Rasul

kashif

AI & ML interests

Time Series Forecasting, Denoising Diffusion, Generative Modeling, Reinforcement Learning

Recent Activity

liked a Space 8 days ago

HuggingFaceH4/blogpost-scaling-test-time-compute

liked a Space 10 days ago

autogluon/fev-leaderboard

liked a model 14 days ago

nicolas-dufour/PLONK_YFCC

View all activity

Articles

Organizations

kashif's activity

upvoted a paper 20 days ago

Mooncake: A KVCache-centric Disaggregated Architecture for LLM Serving

Paper • 2407.00079 • Published Jun 24 • 5

upvoted a paper 21 days ago

RRM: Robust Reward Model Training Mitigates Reward Hacking

Paper • 2409.13156 • Published Sep 20 • 4

upvoted 2 papers 3 months ago

A Rate-Distortion View of Uncertainty Quantification

Paper • 2406.10775 • Published Jun 16 • 1

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19 • 135

upvoted a paper 4 months ago

Spectrum: Targeted Training on Signal to Noise Ratio

Paper • 2406.06623 • Published Jun 7 • 11

upvoted a collection 4 months ago

Power-LM

Collection

Dense & MoE LLMs trained with power learning rate scheduler. • 4 items • Updated Oct 17 • 15

upvoted 2 papers 4 months ago

Agent Q: Advanced Reasoning and Learning for Autonomous AI Agents

Paper • 2408.07199 • Published Aug 13 • 21

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery

Paper • 2408.06292 • Published Aug 12 • 117

upvoted a paper 5 months ago

Exploratory Preference Optimization: Harnessing Implicit Q*-Approximation for Sample-Efficient RLHF

Paper • 2405.21046 • Published May 31 • 4

upvoted 4 articles 6 months ago

Article

Putting RL back in RLHF

Jun 12

• 65

Article

🧨 Diffusers welcomes Stable Diffusion 3

Jun 12

• 92

Article

The Annotated Diffusion Model

Jun 7, 2022

• 123

Article

Welcome Gemma 2 - Google's new open LLM

Jun 27

• 124

upvoted 3 papers 6 months ago

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

Paper • 2406.17557 • Published Jun 25 • 87

GLiNER multi-task: Generalist Lightweight Model for Various Information Extraction Tasks

Paper • 2406.12925 • Published Jun 14 • 23

Margin-aware Preference Optimization for Aligning Diffusion Models without Reference

Paper • 2406.06424 • Published Jun 10 • 12

upvoted 2 papers 9 months ago

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12 • 63

VILA: On Pre-training for Visual Language Models

Paper • 2312.07533 • Published Dec 12, 2023 • 20

upvoted a collection 9 months ago

Moirai-R models

Collection

10 items • Updated 6 days ago • 36

upvoted a collection 10 months ago

Chronos Models & Datasets

Collection

Collection of artifacts related to Chronos pretrained models for time series forecasting. • 12 items • Updated 29 days ago • 36

Kashif Rasul

AI & ML interests

Recent Activity

Articles

How NuminaMath Won the 1st AIMO Progress Prize

Preference Optimization for Vision Language Models

🧨 Diffusers welcomes Stable Diffusion 3

Patch Time Series Transformer in Hugging Face

Constitutional AI with Open LLMs

PatchTSMixer in HuggingFace

Preference Tuning LLMs with Direct Preference Optimization Methods

Finetune Stable Diffusion Models with DDPO via TRL

Introducing Würstchen: Fast Diffusion for Image Generation

Fine-tune Llama 2 with DPO

Yes, Transformers are Effective for Time Series Forecasting (+ Autoformer)

StackLLaMA: A hands-on guide to train LLaMA with RLHF

Multivariate Probabilistic Time Series Forecasting with Informer

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

Probabilistic Time Series Forecasting with 🤗 Transformers

The Annotated Diffusion Model

Organizations

kashif's activity

Putting RL back in RLHF

🧨 Diffusers welcomes Stable Diffusion 3

The Annotated Diffusion Model

Welcome Gemma 2 - Google's new open LLM