arxiv:2501.04682
Anikait Singh
Asap7772
AI & ML interests
Deep Learning, Reinforcement Learning, Robotics
Recent Activity
updated
a dataset
1 day ago
Asap7772/persona-iterative-responses-per50
updated
a dataset
1 day ago
Asap7772/persona-iterative-responses-per20
updated
a dataset
1 day ago
Asap7772/persona-iterative-responses-per10
Organizations
models
18
Asap7772/prm_datamath-mc-full_objbce_lr5e-06_epoch0
Text Generation
•
Updated
•
10
Asap7772/prm_datamath-mc-full_objbce_lr1e-07_epoch0
Text Generation
•
Updated
•
3
Asap7772/prm_datamath-mc-full_objbce_lr1e-06_epoch0
Text Generation
•
Updated
•
8
Asap7772/prm_datamath-mc-full_objbce_lr5e-05_epoch0
Text Generation
•
Updated
•
9
Asap7772/prm_datamath-mc-full_objbce_lr1e-05_epoch0
Text Generation
•
Updated
•
9
Asap7772/prm_datamath-mc-full_objbce_lr5e-07_epoch0
Text Generation
•
Updated
•
2
Asap7772/prm_datamath-mc-full_objbce_lr0.0005_epoch0
Text Generation
•
Updated
•
6
Asap7772/prm_datamath-mc-full_objbce_lr5e-06_checkpoint2400
Updated
Asap7772/prm_datamath-mc-full_objbce_lr5e-05_checkpoint2400
Updated
Asap7772/prm_datamath-mc-full_objbce_lr1e-05_checkpoint2400
Updated
datasets
857
Asap7772/persona-iterative-responses-per50
Viewer
•
Updated
•
210k
•
2
Asap7772/persona-iterative-responses-per20
Viewer
•
Updated
•
83.8k
•
3
Asap7772/persona-iterative-responses-per10
Viewer
•
Updated
•
41.8k
•
2
Asap7772/persona-iterative-responses-per5
Viewer
•
Updated
•
20.5k
•
2
Asap7772/elix_multexpert_preferences_gpt4o_pref
Viewer
•
Updated
•
533k
•
5
Asap7772/elix_multexpert_preferences_gpt-4o_pref_test
Viewer
•
Updated
•
267k
•
2
Asap7772/elix_multexpert_preferences_gpt-4o_pref_train
Viewer
•
Updated
•
267k
•
2
Asap7772/elix_multexpert_preferences
Viewer
•
Updated
•
533k
•
5
Asap7772/elix_multexpert_generations_flat
Viewer
•
Updated
•
10.7M
•
12
Asap7772/elix_multexpert_generations_llama32_3b_fixed
Viewer
•
Updated
•
60.7k
•
3