Nishith Jain's picture

Nishith Jain

KingNish

·

AI & ML interests

AI is fun actually. Busy till June 2025.

Recent Activity

upvoted a paper about 11 hours ago

Llamba: Scaling Distilled Recurrent Models for Efficient Language Processing

new activity about 12 hours ago

KingNish/qwen-1b-continued-v2.2:Adding `safetensors` variant of this model

updated a model 1 day ago

KingNish/qwen-1b-continued-v2.2

View all activity

Organizations

KingNish's activity

upvoted a paper about 11 hours ago

Llamba: Scaling Distilled Recurrent Models for Efficient Language Processing

Paper • 2502.14458 • Published 17 days ago • 2

upvoted an article 3 days ago

Article

Fine-Tune Whisper with 🤗 Transformers

Nov 3, 2022

• 188

upvoted a paper 5 days ago

SoS1: O1 and R1-Like Reasoning LLMs are Sum-of-Square Solvers

Paper • 2502.20545 • Published 10 days ago • 20

upvoted an article 11 days ago

Article

FastRTC: The Real-Time Communication Library for Python

13 days ago

• 130

upvoted an article 13 days ago

Article

Remote VAEs for decoding with HF endpoints 🤗

14 days ago

• 34

upvoted 2 papers 13 days ago

Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads

Paper • 2401.10774 • Published Jan 19, 2024 • 55

Hydra: Sequentially-Dependent Draft Heads for Medusa Decoding

Paper • 2402.05109 • Published Feb 7, 2024 • 1

upvoted an article 14 days ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

18 days ago

• 196

upvoted an article 16 days ago

Article

The Large Language Model Course

By

•

Jan 16

• 126

upvoted a paper 19 days ago

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22 • 101

upvoted an article 25 days ago

Article

Fine-tune ModernBERT for text classification using synthetic data

By

•

Dec 30, 2024

• 32

upvoted a paper 27 days ago

Ultra-Sparse Memory Network

Paper • 2411.12364 • Published Nov 19, 2024 • 22

upvoted 2 papers about 1 month ago

LIMO: Less is More for Reasoning

Paper • 2502.03387 • Published Feb 5 • 57

Assisting in Writing Wikipedia-like Articles From Scratch with Large Language Models

Paper • 2402.14207 • Published Feb 22, 2024 • 8

upvoted an article about 1 month ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.14k

upvoted a paper about 1 month ago

Scalable-Softmax Is Superior for Attention

Paper • 2501.19399 • Published Jan 31 • 21

upvoted an article about 1 month ago

Article

🚀 Deploying OLMo-7B with Text Generation Inference (TGI) on Hugging Face Spaces

By

•

Feb 2

• 5

upvoted a paper about 1 month ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 109

upvoted an article about 1 month ago

Article

They Said It Couldn’t Be Done

By

and 2 others •

Dec 5, 2024

• 82

upvoted a collection about 1 month ago

LLM Reasoning Papers

Papers to improve reasoning capabilities of LLMs • 20 items • Updated Jan 15 • 120