High quality pretraining and instruction datasets for law, mathematics, and science.
Casey
casey-martin
AI & ML interests
Biomedical Tool Usage
Graph Learning
Ecophysiology
Recent Activity
liked
a dataset
1 day ago
microsoft/EpiCoder-func-380k
liked
a dataset
6 days ago
simplescaling/s1K-claude-3-7-sonnet
liked
a dataset
6 days ago
KodCode/KodCode-V1-SFT-R1
Organizations
Collections
1
models
None public yet
datasets
10
casey-martin/Seal-Tools
Viewer
•
Updated
•
14.1k
•
95
casey-martin/GeneGPT
Preview
•
Updated
•
76
casey-martin/math_notebooks
Viewer
•
Updated
•
18.1k
•
83
casey-martin/CommonLit-Ease-of-Readability
Viewer
•
Updated
•
4.72k
•
86
•
1
casey-martin/multilingual-mathematical-autoformalization
Viewer
•
Updated
•
666k
•
311
•
2
casey-martin/MedInstruct
Preview
•
Updated
•
59
•
7
casey-martin/qald_9_plus
Viewer
•
Updated
•
15.8k
•
231
•
1
casey-martin/vquanda
Viewer
•
Updated
•
5k
•
81
•
3
casey-martin/protocols_io
Updated
•
38
casey-martin/oa_cpp_annotate_gen
Viewer
•
Updated
•
104k
•
83
•
2