Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Aviv-anthonnyolime
's Collections
Papers
Dataset
Model - Misc
Paper - Multimodal
Audio Dataset
Text-to-image
Omni-model
Audio model
Dataset
updated
Feb 2
Upvote
-
mlfoundations/MINT-1T-HTML
Viewer
•
Updated
Sep 21, 2024
•
623M
•
320k
•
82
mlfoundations/MINT-1T-ArXiv
Viewer
•
Updated
Sep 19, 2024
•
5.6M
•
656
•
48
mlfoundations/MINT-1T-PDF-CC-2024-18
Updated
Sep 19, 2024
•
7.94k
•
19
mlfoundations/dclm-baseline-1.0-parquet
Viewer
•
Updated
Jul 19, 2024
•
2.73B
•
20.2k
•
26
HuggingFaceFW/fineweb-edu
Viewer
•
Updated
Jan 31
•
3.3B
•
499k
•
648
HuggingFaceFW/fineweb
Viewer
•
Updated
Jan 31
•
25B
•
317k
•
2.02k
jat-project/jat-dataset
Viewer
•
Updated
Feb 16, 2024
•
258M
•
489k
•
37
HuggingFaceTB/finemath
Viewer
•
Updated
Feb 6
•
48.3M
•
11.5k
•
292
DAMO-NLP-SG/multimodal_textbook
Updated
Jan 11
•
5.27k
•
132
fhswf/TinyStoriesV2_cleaned
Viewer
•
Updated
May 23, 2024
•
2.71M
•
891
•
10
TurkuNLP/finerweb-10bt
Viewer
•
Updated
Jan 17
•
7.1M
•
572
•
6
Upvote
-
Share collection
View history
Collection guide
Browse collections