arxiv:2410.02725
Anikait Singh
Asap7772
AI & ML interests
Deep Learning, Reinforcement Learning, Robotics
Recent Activity
updated
a dataset
about 2 hours ago
Asap7772/Math-steptok-prm
updated
a dataset
about 4 hours ago
Asap7772/Math-steptok-mc-relabeled
updated
a dataset
about 7 hours ago
Asap7772/Math-steptok-mc
Organizations
models
8
Asap7772/mathcamp_sft_llama3-1-8b
Text Generation
•
Updated
•
7
Asap7772/anikait-prm_lr1e-5-datamath-mc-seed15486-exp0_epoch0_checkpoint1
Updated
Asap7772/anikait-prm_lr1e-5-datamath-mc-seed31426-exp0_epoch0_checkpoint2
Updated
Asap7772/anikait-prm_lr1e-5-datamath-mc-seed31426-exp0_epoch0_checkpoint1
Text Generation
•
Updated
•
12
Asap7772/anikait-prm_lr1e-5-datamath-mc-seed26382-exp0_epoch0_checkpoint2
Updated
Asap7772/anikait-prm_lr1e-5-datamath-mc-seed26382-exp0_epoch0_checkpoint1
Text Generation
•
Updated
•
14
Asap7772/elix-llama32-3b-ipo
Updated
Asap7772/sft-prm800k-llama31-8b-steptok
Text Generation
•
Updated
•
2.35k
datasets
469
Asap7772/Math-steptok-prm
Viewer
•
Updated
•
200k
Asap7772/Math-steptok-mc-relabeled
Viewer
•
Updated
•
2.72M
Asap7772/Math-steptok-mc
Viewer
•
Updated
•
2.72M
•
71
Asap7772/Math-steptok-steps-mcvalue-test-part3-of-5
Viewer
•
Updated
•
22.1k
Asap7772/Math-steptok-steps-mcvalue-test-part5-of-5
Viewer
•
Updated
•
22.1k
Asap7772/Math-steptok-steps-mcvalue-test-part2-of-5
Viewer
•
Updated
•
22.1k
Asap7772/Math-steptok-steps-mcvalue-test-part4-of-5
Viewer
•
Updated
•
22.1k
Asap7772/Math-steptok-steps-mcvalue-test-part1-of-5
Viewer
•
Updated
•
22.1k
Asap7772/Math-steptok-steps-mcvalue-train-part4-of-5
Viewer
•
Updated
•
521k
•
9
Asap7772/Math-steptok-steps-mcvalue-train-part2-of-5
Viewer
•
Updated
•
521k
•
10