Ferdinand Mom's picture
2 8

Ferdinand Mom

3outeille

AI & ML interests

None yet

Recent Activity

Organizations

Hugging Face's profile picture HuggingFaceBR4's profile picture Hugging Face TB Research's profile picture huggingPartyParis's profile picture Nanotron Research's profile picture HuggingFaceFW's profile picture

3outeille's activity

reacted to loubnabnl's post with β€οΈπŸ€—πŸ€― 10 months ago
view post
Post
⭐ Today we’re releasing The Stack v2 & StarCoder2: a series of 3B, 7B & 15B code generation models trained on 3.3 to 4.5 trillion tokens of code:

- StarCoder2-15B matches or outperforms CodeLlama 34B, and approaches DeepSeek-33B on multiple benchmarks.
- StarCoder2-3B outperforms StarCoderBase-15B and similar sized models.
- The Stack v2 a 4x larger dataset than the Stack v1, resulting in 900B unique code tokens πŸš€
As always, we released everything from models and datasets to curation code. Enjoy!

πŸ”— StarCoder2 collection: bigcode/starcoder2-65de6da6e87db3383572be1a
πŸ”— Paper: https://drive.google.com/file/d/17iGn3c-sYNiLyRSY-A85QOzgzGnGiVI3/view
πŸ”— BlogPost: https://huggingface.co./blog/starcoder2
πŸ”— Code Leaderboard: bigcode/bigcode-models-leaderboard