Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
5
1
8
Alex Havrilla
Dahoas
Follow
BiasedByBytes's profile picture
mzhaoshuai's profile picture
Qubitium's profile picture
64 followers
·
0 following
https://dahoas.github.io/
dahoas
AI & ML interests
NLP, RL
Recent Activity
updated
a dataset
3 days ago
Dahoas/numina-synthetic
updated
a dataset
15 days ago
Dahoas/aimo-validation-aime
upvoted
a
paper
19 days ago
Surveying the Effects of Quality, Diversity, and Complexity in Synthetic Data From Large Language Models
View all activity
Articles
Illustrating Reinforcement Learning from Human Feedback (RLHF)
Dec 9, 2022
•
119
Organizations
Papers
3
arxiv:
2412.02980
arxiv:
2403.04642
arxiv:
2402.10963
models
33
Sort: Recently updated
Dahoas/gptj-rm-IHP
Updated
Mar 8, 2023
•
2
Dahoas/gptneox-response-full-static-sft
Text Generation
•
Updated
Mar 6, 2023
•
13
•
1
Dahoas/pythia-1B-response-full-static-sft
Text Generation
•
Updated
Mar 6, 2023
•
17
•
1
Dahoas/pythia-125M-response-full-static-sft
Text Generation
•
Updated
Mar 6, 2023
•
13
•
1
Dahoas/synthetic-pythia-6B-rm-sft-response
Text Generation
•
Updated
Mar 2, 2023
•
17
Dahoas/pythia-6B-sft-response-full-static
Text Generation
•
Updated
Feb 27, 2023
•
18
•
1
Dahoas/gptj-6B-response-full-static-sft
Text Generation
•
Updated
Feb 15, 2023
•
14
•
1
Dahoas/pythia-6B-rm-response-full-hh
Updated
Feb 15, 2023
Dahoas/gptj-response-full-sft
Text Generation
•
Updated
Feb 15, 2023
•
12
•
1
Dahoas/pythia-6b-rm-response-only-full-hh
Text Generation
•
Updated
Feb 14, 2023
•
14
Expand 33 models
datasets
147
Sort: Recently updated
Dahoas/numina-synthetic
Viewer
•
Updated
3 days ago
•
361k
•
118
Dahoas/aimo-validation-aime
Viewer
•
Updated
15 days ago
•
90
•
16
Dahoas/qwen-1.5-4B-default-positives-epoch-1-100
Viewer
•
Updated
20 days ago
•
290k
•
44
Dahoas/qwen-1.5-4B-tree-positives-epoch-2-100
Viewer
•
Updated
20 days ago
•
491k
•
42
Dahoas/qwen-1.5-4B-tree-positives-epoch-1-100
Viewer
•
Updated
20 days ago
•
477k
•
47
Dahoas/qwen-1.5-4B-epoch-1-test-100
Viewer
•
Updated
27 days ago
•
498k
•
63
Dahoas/qwen-1.5-4B-K-100-test
Viewer
•
Updated
Nov 5
•
500k
•
31
Dahoas/MATH_train_K_100_qwen_1.5_4B_outputs
Viewer
•
Updated
Oct 22
•
750k
•
36
Dahoas/MATH-K-100-train
Viewer
•
Updated
Sep 12
•
750k
•
1.27k
•
2
Dahoas/gsm8k_reformatted
Viewer
•
Updated
Aug 13
•
8.79k
•
37
Expand 147 datasets