Datasets and models associated with the paper "Large-Scale Data Selection for Instruction Tuning" (https://arxiv.org/abs/2503.01807)
Hamish Ivison
hamishivi
AI & ML interests
NLP :)
Organizations
models
33

hamishivi/tulu_3_long_finetune_qwen_7b_reg_system_prompt
Updated
•
249

hamishivi/tulu-2-wildchat-326k-sft
Updated
•
10

hamishivi/tulu-2-arena-hard-326k-sft
Updated
•
19

hamishivi/llama-3.1-tulu-3-arena-hard-939k-sft
Updated
•
19

hamishivi/llama-3.1-tulu-3-multitask-rrmax-939k-sft
Updated
•
22

hamishivi/tulu-2-multitask-rrmax-326k-sft
Updated
•
37

hamishivi/qwen2_math_tokenizer_tweaked
Updated

hamishivi/0224_jupiter_hamish_grpo_tulu3_s1k_orz_30350
Updated
•
122

hamishivi/0224_jupiter_hamish_grpo_s1k_only_orz_24021
Updated
•
267

hamishivi/0224_jupiter_hamish_grpo_tulu3_only_orz_1432
Updated
•
175
datasets
38
hamishivi/SimpleQA-RLVR
Viewer
•
Updated
•
4.33k
•
71
hamishivi/lsds_data
Preview
•
Updated
•
104
hamishivi/rds-sels-tydiqa-shots-top326k
Viewer
•
Updated
•
326k
•
33
hamishivi/rds-sels-squad-top326k
Viewer
•
Updated
•
326k
•
32
hamishivi/rds-sels-mmlu-shots-top326k
Viewer
•
Updated
•
326k
•
24
hamishivi/rds-sels-bbh-shots-top326k
Viewer
•
Updated
•
326k
•
28
hamishivi/rds-sels-codex-top326k
Viewer
•
Updated
•
326k
•
28
hamishivi/rds-sels-gsm8k-shots-top326k
Viewer
•
Updated
•
326k
•
99
hamishivi/rds-sels-alpacafarm-top326k
Viewer
•
Updated
•
326k
•
36
hamishivi/rds-sels-wildchat-top326k
Viewer
•
Updated
•
326k
•
27