Elsayed Mohamed's picture

Elsayed Mohamed

sayedM

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago
Qwen/Qwen2.5-VL-72B-Instruct
liked a model 2 days ago
OmarSamir/EGTTS-V0.1
liked a Space 2 days ago
deepseek-ai/Janus-Pro-7B
View all activity

Organizations

Pxivision's profile picture

sayedM's activity

liked a Space 4 days ago
upvoted an article 4 days ago
view article
Article

We now support VLMs in smolagents!

62
reacted to m-ric's post with 🔥 4 days ago
view post
Post
2368
Today we make the biggest release in smolagents so far: 𝘄𝗲 𝗲𝗻𝗮𝗯𝗹𝗲 𝘃𝗶𝘀𝗶𝗼𝗻 𝗺𝗼𝗱𝗲𝗹𝘀, 𝘄𝗵𝗶𝗰𝗵 𝗮𝗹𝗹𝗼𝘄𝘀 𝘁𝗼 𝗯𝘂𝗶𝗹𝗱 𝗽𝗼𝘄𝗲𝗿𝗳𝘂𝗹 𝘄𝗲𝗯 𝗯𝗿𝗼𝘄𝘀𝗶𝗻𝗴 𝗮𝗴𝗲𝗻𝘁𝘀! 🥳

Our agents can now casually open up a web browser, and navigate on it by scrolling, clicking elements on the webpage, going back, just like a user would.

The demo below shows Claude-3.5-Sonnet browsing GitHub for task: "Find how many commits the author of the current top trending repo did over last year."
Hi @mlabonne !

Go try it out, it's the most cracked agentic stuff I've seen in a while 🤯 (well, along with OpenAI's Operator who beat us by one day)

For more detail, read our announcement blog 👉 https://huggingface.co./blog/smolagents-can-see
The code for the web browser example is here 👉 https://github.com/huggingface/smolagents/blob/main/examples/vlm_web_browser.py
·
upvoted an article 4 days ago
view article
Article

Introducing smolagents: simple agents that write actions in code.

533
reacted to alibabasglab's post with 👍 8 days ago
updated a Space about 1 month ago
liked a Space about 2 months ago