Clem 🤗's picture

Clem 🤗 PRO

clem

·

http://huggingface.co

AI & ML interests

multi-modal, time-series, biology and chemistry

Recent Activity

liked a model about 8 hours ago

deepseek-ai/Janus-Pro-7B

liked a model about 9 hours ago

deepseek-ai/Janus-Pro-1B

reacted to fdaudens's post with ❤️ about 10 hours ago

Yes, DeepSeek R1's release is impressive. But the real story is what happened in just 7 days after: - Original release: 8 models, 540K downloads. Just the beginning... - The community turned those open-weight models into +550 NEW models on Hugging Face. Total downloads? 2.5M—nearly 5X the originals. The reason? DeepSeek models are open-weight, letting anyone build on top of them. Interesting to note that the community focused on quantized versions for better efficiency & accessibility. They want models that use less memory, run faster, and are more energy-efficient. When you empower builders, innovation explodes. For everyone. 🚀 The most popular community model? @bartowski's DeepSeek-R1-Distill-Qwen-32B-GGUF version — 1M downloads alone.

View all activity

Organizations

clem's activity

upvoted a paper 3 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 5 days ago • 224

upvoted an article 5 days ago

Article

Hugging Face and FriendliAI partner to supercharge model deployment on the Hub

6 days ago

• 29

upvoted a collection 6 days ago

DeepSeek-R1

8 items • Updated 7 days ago • 205

upvoted a paper 7 days ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published 11 days ago • 98

upvoted an article 7 days ago

Article

Yay! Organizations can now publish blog Articles

By

•

7 days ago

• 30

upvoted a collection 12 days ago

2025 Artifacts

20 items • Updated 1 day ago • 4

upvoted a collection 14 days ago

DeepSeek-V3

3 items • Updated 22 days ago • 147

upvoted an article 14 days ago

Article

🌁#83: GAN is back

By

•

14 days ago

• 7

upvoted an article 16 days ago

Article

🦸🏻#7: From Agentic AI to Physical AI

By

•

16 days ago

• 6

upvoted 2 papers 18 days ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published 24 days ago • 87

Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though

Paper • 2501.04682 • Published 19 days ago • 89

upvoted a collection 18 days ago

TACO Models

This collection contains the best-performing TACO models based on LLaMA-3/Qwen2 and SigLIP/CLIP. • 3 items • Updated Dec 20, 2024 • 8

upvoted 3 papers 18 days ago

OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis

Paper • 2412.19723 • Published Dec 27, 2024 • 81

Cosmos World Foundation Model Platform for Physical AI

Paper • 2501.03575 • Published 21 days ago • 66

DeepSeek-V3 Technical Report

Paper • 2412.19437 • Published Dec 27, 2024 • 36

upvoted a collection 18 days ago

My Loras

loras i've made that are hosted here • 12 items • Updated 13 days ago • 6

upvoted a paper 18 days ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 20 days ago • 249

upvoted an article 22 days ago

Article

Inference Endpoints Changelog 🚀

By

•

Oct 11, 2024

• 20

upvoted a paper about 1 month ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 125

upvoted a collection about 1 month ago

ModernBERT

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 127