3 87 87

Shyam Sunder Kumar

theainerd

AI & ML interests

Natural Language Processing

Recent Activity

reacted to onekq's post with 👍 about 3 hours ago

QwQ-32B is amazing! It ranks below o1-preview, but beats DeepSeek v3 and all Gemini models. https://huggingface.co./spaces/onekq-ai/WebApp1K-models-leaderboard Now we have such a powerful model that can fit into a single GPU, can someone finetune a web app model to push SOTA of my leaderboard? 🤗

reacted to clem's post with 🔥 about 20 hours ago

I was chatting with @peakji , one of the cofounders of Manu AI, who told me he was on Hugging Face (very cool!). He shared an interesting insight which is that agentic capabilities might be more of an alignment problem rather than a foundational capability issue. Similar to the difference between GPT-3 and InstructGPT, some open-source foundation models are simply trained to 'answer everything in one response regardless of the complexity of the question' - after all, that's the user preference in chatbot use cases. Just a bit of post-training on agentic trajectories can make an immediate and dramatic difference. As a thank you to the community, he shared 100 invite code first-come first serve, just use “HUGGINGFACE” to get access!

upvoted a paper 2 days ago

Babel: Open Multilingual Large Language Models Serving Over 90% of Global Speakers

View all activity

Organizations

theainerd's activity

upvoted a paper 2 days ago

Babel: Open Multilingual Large Language Models Serving Over 90% of Global Speakers

Paper • 2503.00865 • Published 7 days ago • 55

upvoted an article 10 days ago

Article

SigLIP 2: A better multilingual vision language encoder

17 days ago

• 126

upvoted a paper 13 days ago

LightThinker: Thinking Step-by-Step Compression

Paper • 2502.15589 • Published 16 days ago • 26

upvoted 2 papers 15 days ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published 17 days ago • 128

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published 17 days ago • 177

upvoted a paper 16 days ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published 17 days ago • 94

upvoted 2 papers 17 days ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published 18 days ago • 157

Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs

Paper • 2502.12982 • Published 19 days ago • 14

upvoted a paper 20 days ago

Jailbreaking to Jailbreak

Paper • 2502.09638 • Published 28 days ago • 4

upvoted a paper 21 days ago

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published 25 days ago • 143

upvoted an article 25 days ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.14k

upvoted an article 26 days ago

Article

Open R1: Update #2

and 6 others •

27 days ago

• 197

upvoted a paper 29 days ago

Gold-medalist Performance in Solving Olympiad Geometry with AlphaGeometry2

Paper • 2502.03544 • Published Feb 5 • 43

upvoted 2 articles about 1 month ago

Article

Welcome to Inference Providers on the Hub 🔥

Jan 28

• 416

Article

Open-R1: Update #1

and 7 others •

Feb 2

• 293

upvoted a collection about 1 month ago

🧠 Reasoning datasets

Collection

Datasets with reasoning traces for math and code released by the community • 14 items • Updated 1 day ago • 90

upvoted a paper about 1 month ago

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published Jan 30 • 56

upvoted an article about 1 month ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 795

upvoted a paper about 1 month ago

Humanity's Last Exam

Paper • 2501.14249 • Published Jan 24 • 65

upvoted a collection about 1 month ago

Qwen2.5-1M

Collection

The long-context version of Qwen2.5, supporting 1M-token context lengths • 3 items • Updated 11 days ago • 105