arxiv:2501.04682
Violet Xiang
violetxi
·
AI & ML interests
None yet
Organizations
Papers
2
models
386
violetxi/sft_tir_3e-5_b32_warmup0.1_checkpoint-epoch2
Text Generation
•
Updated
•
65k
violetxi/ak-prm-full_sft-steptok_lr1e-5_wa0.03_wd0.01_epoch0
Text Generation
•
Updated
•
1
violetxi/ak-prm-full_sft-steptok_lr1e-5_wa0.03_wd0.01_checkpoint12000
Text Generation
•
Updated
•
1
violetxi/ak-prm-full_sft-steptok_lr1e-5_wa0.03_wd0.01_checkpoint6000
Text Generation
•
Updated
•
1
violetxi/ak-prm-full_sft-steptok_lr1e-5_wa0.03_epoch0
Text Generation
•
Updated
•
3
violetxi/ak-prm-full_sft-steptok_lr1e-5_wa0.03_checkpoint12000
Text Generation
•
Updated
•
2
violetxi/ak-prm-full_sft-steptok_lr1e-5_wa0.03_checkpoint11400
Text Generation
•
Updated
•
3
violetxi/ak-prm-full_sft-steptok_lr1e-5_wa0.03_checkpoint10800
Text Generation
•
Updated
•
4
violetxi/ak-prm-full_sft-steptok_lr1e-5_wa0.03_checkpoint10200
Text Generation
•
Updated
•
2
violetxi/ak-prm-subfull_base_lr1e-5_wa0.03_wd0.01_checkpoint1260
Updated
datasets
263
violetxi/MATH-500_L5_best_first_N128_B32_D15_T0.0001_0-134
Viewer
•
Updated
•
42
•
44
violetxi/MATH-500_L3_best_first_N128_B16_D15_T0.0001_0-21
Viewer
•
Updated
•
21
•
11
violetxi/MATH-500_L5_best_first_N128_B16_D15_T0.0001_0-134
Viewer
•
Updated
•
18
•
37
violetxi/MATH-500_L4_best_first_N128_B8_D15_T0.0001_86-100
Viewer
•
Updated
•
14
•
34
violetxi/MATH-500_L2_best_first_N128_B16_D15_T0.0001_0-90
Viewer
•
Updated
•
90
•
33
violetxi/MATH-500_L4_best_first_N128_B16_D15_T0.0001_0-128
Viewer
•
Updated
•
21
•
36
violetxi/MATH-500_L1_best_first_N128_B16_D15_T0.0001_0-43
Viewer
•
Updated
•
43
•
33
violetxi/MATH-500_L3_best_first_N128_B8_D15_T0.0001_0-75
Viewer
•
Updated
•
75
•
36
violetxi/MATH-500_L4_best_first_N128_B8_D15_T0.0001_62-80
Viewer
•
Updated
•
18
•
40
violetxi/MATH-500_L4_best_first_N128_B8_D15_T0.0001_80-86
Viewer
•
Updated
•
6
•
31