35 18 20

vansin

vansin

AI & ML interests

None yet

Recent Activity

commented on a paper 4 days ago

Retrieval Models Aren't Tool-Savvy: Benchmarking Tool Retrieval for Large Language Models

commented on a paper 4 days ago

FLAME: A Federated Learning Benchmark for Robotic Manipulation

commented on a paper 4 days ago

Benchmarking Large Language Models for Multi-Language Software Vulnerability Detection

View all activity

Organizations

vansin's activity

reacted to their post with 🚀🔥 3 months ago

Post

869

Try InternThinker~

https://internlm-chat.intern-ai.org.cn/internthinker

posted an update 3 months ago

Post

869

Try InternThinker~

https://internlm-chat.intern-ai.org.cn/internthinker

reacted to their post with 👀 3 months ago

Post

1270

Amazing !!!! test Post

reacted to loubnabnl's post with 🔥 3 months ago

Post

2969

Making SmolLM2 reproducible: open-sourcing our training & evaluation toolkit 🛠️ https://github.com/huggingface/smollm/

- Pre-training code with nanotron
- Evaluation suite with lighteval
- Synthetic data generation using distilabel (powers our new SFT dataset HuggingFaceTB/smoltalk)
- Post-training scripts with TRL & the alignment handbook
- On-device tools with llama.cpp for summarization, rewriting & agents

Apache 2.0 licensed. V2 pre-training data mix coming soon!

Which other tools should we add next?

posted an update 3 months ago

Post

1270

Amazing !!!! test Post