Loubna Ben Allal

loubnabnl

AI & ML interests

SmolLMs, ML for code, data

Recent Activity

Articles

Organizations

Hugging Face's profile picture BigScience Workshop's profile picture BigScience Catalogue Data's profile picture BigScience Data's profile picture HuggingFaceBR4's profile picture Team 8's profile picture CodeParrot's profile picture BigCode's profile picture Hugging Face H4's profile picture CompVis Community's profile picture BigCode Data's profile picture LocalCodeLLMs's profile picture Need4Speed's profile picture Code Llama's profile picture Hugging Face TB Research's profile picture Hugging Face Smol Cluster's profile picture Nt3awnou's profile picture huggingPartyParis's profile picture Qwen's profile picture ZeroGPU Explorers's profile picture HF AFAIK's profile picture gg-hf's profile picture Nanotron Research's profile picture Women on Hugging Face's profile picture Hugging Face SMOL's profile picture HuggingFaceFW's profile picture bigcode nvidia's profile picture Social Post Explorers's profile picture Dev Mode Explorers's profile picture Cosmopedia Stories Collab's profile picture StarCoder2 Data's profile picture Data Agents's profile picture Argilla Warehouse's profile picture smol-explorers's profile picture swissai-hf-data's profile picture Hugging Face Science's profile picture

loubnabnl's activity

reacted to ginipick's post with ๐Ÿ”ฅ 4 days ago
view post
Post
4120
๐ŸŒŸ Digital Odyssey: AI Image & Video Generation Platform ๐ŸŽจ
Welcome to our all-in-one AI platform for image and video generation! ๐Ÿš€
โœจ Key Features

๐ŸŽจ High-quality image generation from text
๐ŸŽฅ Video creation from still images
๐ŸŒ Multi-language support with automatic translation
๐Ÿ› ๏ธ Advanced customization options

๐Ÿ’ซ Unique Advantages

โšก Fast and accurate results using FLUX.1-dev and Hyper-SD models
๐Ÿ”’ Robust content safety filtering system
๐ŸŽฏ Intuitive user interface
๐Ÿ› ๏ธ Extended toolkit including image upscaling and logo generation

๐ŸŽฎ How to Use

Enter your image or video description
Adjust settings as needed
Click generate
Save and share your results automatically

๐Ÿ”ง Tech Stack

FluxPipeline
Gradio
PyTorch
OpenCV

link: ginigen/Dokdo

Turn your imagination into reality with AI! โœจ
#AI #ImageGeneration #VideoGeneration #MachineLearning #CreativeTech
  • 7 replies
ยท
updated a Space 5 days ago
reacted to anton-l's post with ๐Ÿš€๐Ÿ”ฅ 6 days ago
view post
Post
1965
Introducing ๐Ÿ“๐…๐ข๐ง๐ž๐Œ๐š๐ญ๐ก: the best public math pre-training dataset with 50B+ tokens!
HuggingFaceTB/finemath

Math remains challenging for LLMs and by training on FineMath we see considerable gains over other math datasets, especially on GSM8K and MATH.

We build the dataset by:
๐Ÿ› ๏ธ carefully extracting math data from Common Crawl;
๐Ÿ”Ž iteratively filtering and recalling high quality math pages using a classifier trained on synthetic annotations to identify math reasoning and deduction.

We conducted a series of ablations comparing the performance of Llama-3.2-3B-Base after continued pre-training on FineMath and observe notable gains compared to the baseline model and other public math datasets.

We hope this helps advance the performance of LLMs on math and reasoning! ๐Ÿš€
Weโ€™re also releasing all the ablation models as well as the evaluation code.

HuggingFaceTB/finemath-6763fb8f71b6439b653482c2