56 296 296

Aymeric Roucher

m-ric

http://aymeric-roucher.github.io

AI & ML interests

Leading Agents at Hugging Face 🤗

Recent Activity

updated a Space about 15 hours ago

agents-course/README

upvoted a paper 2 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

upvoted an article 3 days ago

Fine-tune Llama 2 with DPO

View all activity

Articles

Organizations

m-ric's activity

updated a Space about 15 hours ago

Running

🌖

Agents Course

upvoted a paper 2 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 5 days ago • 223

upvoted an article 3 days ago

Article

Fine-tune Llama 2 with DPO

Aug 8, 2023

• 38

reacted to merve's post with 🔥 3 days ago

Post

3751

Oof, what a week! 🥵 So many things have happened, let's recap! merve/jan-24-releases-6793d610774073328eac67a9

Multimodal 💬
- We have released SmolVLM -- tiniest VLMs that come in 256M and 500M, with it's retrieval models ColSmol for multimodal RAG 💗
- UI-TARS are new models by ByteDance to unlock agentic GUI control 🤯 in 2B, 7B and 72B
- Alibaba DAMO lab released VideoLlama3, new video LMs that come in 2B and 7B
- MiniMaxAI released Minimax-VL-01, where decoder is based on MiniMax-Text-01 456B MoE model with long context
- Dataset: Yale released a new benchmark called MMVU
- Dataset: CAIS released Humanity's Last Exam (HLE) a new challenging MM benchmark

LLMs 📖
- DeepSeek-R1 & DeepSeek-R1-Zero: gigantic 660B reasoning models by DeepSeek, and six distilled dense models, on par with o1 with MIT license! 🤯
- Qwen2.5-Math-PRM: new math models by Qwen in 7B and 72B
- NVIDIA released AceMath and AceInstruct, new family of models and their datasets (SFT and reward ones too!)

Audio 🗣️
- Llasa is a new speech synthesis model based on Llama that comes in 1B,3B, and 8B
- TangoFlux is a new audio generation model trained from scratch and aligned with CRPO

Image/Video/3D Generation ⏯️
- Flex.1-alpha is a new 8B pre-trained diffusion model by ostris similar to Flux
- tencent released Hunyuan3D-2, new 3D asset generation from images

7 replies

posted an update 3 days ago

Post

2188

Today we make the biggest release in smolagents so far: 𝘄𝗲 𝗲𝗻𝗮𝗯𝗹𝗲 𝘃𝗶𝘀𝗶𝗼𝗻 𝗺𝗼𝗱𝗲𝗹𝘀, 𝘄𝗵𝗶𝗰𝗵 𝗮𝗹𝗹𝗼𝘄𝘀 𝘁𝗼 𝗯𝘂𝗶𝗹𝗱 𝗽𝗼𝘄𝗲𝗿𝗳𝘂𝗹 𝘄𝗲𝗯 𝗯𝗿𝗼𝘄𝘀𝗶𝗻𝗴 𝗮𝗴𝗲𝗻𝘁𝘀! 🥳

Our agents can now casually open up a web browser, and navigate on it by scrolling, clicking elements on the webpage, going back, just like a user would.

The demo below shows Claude-3.5-Sonnet browsing GitHub for task: "Find how many commits the author of the current top trending repo did over last year."
Hi @mlabonne !

Go try it out, it's the most cracked agentic stuff I've seen in a while 🤯 (well, along with OpenAI's Operator who beat us by one day)

For more detail, read our announcement blog 👉 https://huggingface.co./blog/smolagents-can-see
The code for the web browser example is here 👉 https://github.com/huggingface/smolagents/blob/main/examples/vlm_web_browser.py

3 replies

upvoted an article 3 days ago

Article

We now support VLMs in smolagents!

4 days ago

• 48

updated a dataset 3 days ago

huggingface/documentation-images

Viewer • Updated about 3 hours ago • 50 • 2.66M • 47

updated a Space 3 days ago

Running

🏢

Hf Model Downloads

upvoted an article 4 days ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

5 days ago

• 84

liked a model 6 days ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

Text Generation • Updated 1 day ago • 108k • • 598

upvoted an article 7 days ago

Article

Yay! Organizations can now publish blog Articles

•

7 days ago

• 30

liked a model 7 days ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated 1 day ago • 149k • 3.61k

liked a Space 10 days ago

Running

356

📈

LLM Model VRAM Calculator

upvoted 3 articles 10 days ago

Article

The Large Language Model Course

•

12 days ago

• 77

Article

Diving into MiniMax01 405B MoE

•

13 days ago

• 17

Article

Gradio spaces are the perfect agent tools\!

•

11 days ago

• 12

commented on Alpine Agent: An AI Agent to Navigate Your Winter Mountain Adventures 10 days ago

Love this use-case! Will be very handy for the next Instagram shooting!

reacted to florentgbelidji's post with 🔥 10 days ago

Post

1361

𝗣𝗹𝗮𝗻𝗻𝗶𝗻𝗴 𝗬𝗼𝘂𝗿 𝗡𝗲𝘅𝘁 𝗦𝗸𝗶 𝗔𝗱𝘃𝗲𝗻𝘁𝘂𝗿𝗲 𝗝𝘂𝘀𝘁 𝗚𝗼𝘁 𝗦𝗺𝗮𝗿𝘁𝗲𝗿: 𝗜𝗻𝘁𝗿𝗼𝗱𝘂𝗰𝗶𝗻𝗴 𝗔𝗹𝗽𝗶𝗻𝗲 𝗔𝗴𝗲𝗻𝘁!🏔️⛷️

With the big hype around AI agents these days, I couldn’t stop thinking about how AI agents could truly enhance real-world activities.
What sort of applications could we build with those AI agents: agentic RAG? self-correcting text-to-sql? Nah, boring…

Passionate about outdoors, I’ve always dreamed of a tool that could simplify planning mountain trips while accounting for all potential risks. That’s why I built 𝗔𝗹𝗽𝗶𝗻𝗲 𝗔𝗴𝗲𝗻𝘁, a smart assistant designed to help you plan safe and enjoyable itineraries in the French Alps and Pyrenees.

Built using Hugging Face's 𝘀𝗺𝗼𝗹𝗮𝗴𝗲𝗻𝘁𝘀 library, Alpine Agent combines the power of AI with trusted resources like 𝘚𝘬𝘪𝘵𝘰𝘶𝘳.𝘧𝘳 (https://skitour.fr/) and METEO FRANCE. Whether it’s suggesting a route with moderate difficulty or analyzing avalanche risks and weather conditions, this agent dynamically integrates data to deliver personalized recommendations.

In my latest blog post, I share how I developed this project—from defining tools and integrating APIs to selecting the best LLMs like 𝘘𝘸𝘦𝘯2.5-𝘊𝘰𝘥𝘦𝘳-32𝘉-𝘐𝘯𝘴𝘵𝘳𝘶𝘤𝘵, 𝘓𝘭𝘢𝘮𝘢-3.3-70𝘉-𝘐𝘯𝘴𝘵𝘳𝘶𝘤𝘵, or 𝘎𝘗𝘛-4.

⛷️ Curious how AI can enhance adventure planning? Try the app and share your thoughts: florentgbelidji/alpine-agent

👉 Want to build your own agents? Whether for cooking, sports training, or other passions, the possibilities are endless. Check out the blog post to learn more: https://huggingface.co./blog/florentgbelidji/alpine-agent

Many thanks to @m-ric for helping on building this tool with smolagents!

1 reply

upvoted an article 10 days ago

Article

Alpine Agent: An AI Agent to Navigate Your Winter Mountain Adventures

•

10 days ago

• 3

liked a model 11 days ago

internlm/internlm3-8b-instruct

Text Generation • Updated 12 days ago • 16.8k • 188

Aymeric Roucher

AI & ML interests

Recent Activity

Articles

We now support VLMs in smolagents!

Introducing smolagents: simple agents that write actions in code.

Expert Support case study: Bolstering a RAG app with LLM-as-a-Judge

Our Transformers Code Agent beats the GAIA benchmark!

Extracting Concepts from LLMs: Anthropic’s recent discoveries 📖

License to Call: Introducing Transformers Agents 2.0

Open-source LLMs as LangChain Agents

Organizations

m-ric's activity

Agents Course

Fine-tune Llama 2 with DPO

We now support VLMs in smolagents!

Hf Model Downloads

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

Yay! Organizations can now publish blog Articles

LLM Model VRAM Calculator

The Large Language Model Course

Diving into MiniMax01 405B MoE

Gradio spaces are the perfect agent tools\!

Alpine Agent: An AI Agent to Navigate Your Winter Mountain Adventures