Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
78.0
TFLOPS
43
129
279
Gabriel Martín Blázquez
gabrielmbmb
Follow
thesven's profile picture
ZuriOutlook's profile picture
Alignment-Lab-AI's profile picture
99 followers
·
57 following
https://gabrielmb.com
gabrielmbmb_
gabrielmbmb
gabrielmbmb
gabrielmb.com
AI & ML interests
ML Engineer
Recent Activity
reacted
to
anton-l
's
post
with 🚀
4 days ago
Introducing 📐𝐅𝐢𝐧𝐞𝐌𝐚𝐭𝐡: the best public math pre-training dataset with 50B+ tokens! https://huggingface.co./datasets/HuggingFaceTB/finemath Math remains challenging for LLMs and by training on FineMath we see considerable gains over other math datasets, especially on GSM8K and MATH. We build the dataset by: 🛠️ carefully extracting math data from Common Crawl; 🔎 iteratively filtering and recalling high quality math pages using a classifier trained on synthetic annotations to identify math reasoning and deduction. We conducted a series of ablations comparing the performance of Llama-3.2-3B-Base after continued pre-training on FineMath and observe notable gains compared to the baseline model and other public math datasets. We hope this helps advance the performance of LLMs on math and reasoning! 🚀 We’re also releasing all the ablation models as well as the evaluation code. https://huggingface.co./collections/HuggingFaceTB/finemath-6763fb8f71b6439b653482c2
updated
a dataset
5 days ago
gabrielmbmb/gsm8k-reasoning-paths-combined
upvoted
a
paper
5 days ago
Qwen2.5 Technical Report
View all activity
Articles
How we leveraged distilabel to create an Argilla 2.0 Chatbot
Jul 16
•
32
Organizations
gabrielmbmb
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
argilla/magpie-ultra-v1.0
23 days ago
Question About Dataset Content
1
#2 opened 25 days ago by
chrisliu298
New activity in
allenai/tulu-3-sft-personas-math
29 days ago
Add link to Tulu 3 paper
#2 opened 29 days ago by
gabrielmbmb
New activity in
argilla/ifeval-like-data
2 months ago
Delete 'filtered_and_decontaminated' config
#2 opened 2 months ago by
gabrielmbmb
New activity in
gabrielmbmb/distilabel-reflection-tuning
4 months ago
Are there any format requirements for system_prompt in TextGeneration?
4
#4 opened 4 months ago by
Terrence-wpc
is it possible running in offline env?
3
#3 opened 4 months ago by
xDAN2099
Possible to do something like this using together API?
1
#2 opened 4 months ago by
nkasmanoff
New activity in
argilla/magpie-ultra-v0.1
4 months ago
About response_base
3
#9 opened 4 months ago by
flydust
New activity in
argilla/mmlu-translation-progress
4 months ago
Space not working
2
#1 opened 4 months ago by
alkibijad
New activity in
argilla/magpie-ultra-v0.1
5 months ago
Upload B 4.wav
#5 opened 5 months ago by
YetNha
just wanted to say ty
1
#4 opened 5 months ago by
skratos115
New activity in
prometheus-eval/prometheus-7b-v2.0
5 months ago
Allow passing system prompt to chat template
#4 opened 5 months ago by
gabrielmbmb
Tokenizer chat template doesn't accept system prompt
1
#3 opened 5 months ago by
gabrielmbmb
New activity in
gabrielmbmb/magpie-llama-3-70b-instruct
5 months ago
Librarian Bot: Add language metadata for dataset
#1 opened 5 months ago by
librarian-bot
New activity in
argilla/magpie-ultra-v0.1
5 months ago
dataset topic diversity
3
#2 opened 5 months ago by
pszemraj
New activity in
RLHFlow/ArmoRM-Llama3-8B-v0.1
5 months ago
Update modeling_custom.py
#14 opened 5 months ago by
gabrielmbmb
New activity in
argilla/dpo-mix-7k
8 months ago
Thank you
1
#5 opened 8 months ago by
Yuma42
New activity in
google-bert/bert-base-uncased
8 months ago
Update LayerNorm tensor names to weight and bias (from gamma and beta)
1
#70 opened 8 months ago by
gabrielmbmb
New activity in
argilla/distilabeled-Marcoro14-7B-slerp
10 months ago
Adding Evaluation Results
#3 opened 10 months ago by
leaderboard-pr-bot
New activity in
argilla/DistilabelBeagle14-7B
10 months ago
Adding Evaluation Results
#1 opened 10 months ago by
leaderboard-pr-bot
New activity in
argilla/distilabeled-Marcoro14-7B-slerp-full
10 months ago
Adding Evaluation Results
#1 opened 10 months ago by
leaderboard-pr-bot
Load more