1 8 8

Lichang Chen

Lichang-Chen

https://lichang-chen.github.io/

AI & ML interests

NLP and ML

Recent Activity

authored a paper 9 days ago

Self-rewarding correction for mathematical reasoning

upvoted a paper 10 days ago

Self-rewarding correction for mathematical reasoning

updated a model 12 days ago

Lichang-Chen/Qwen2.5-14B-Instruct-star-nl-3Rounds-iter-3

View all activity

Organizations

Collections 1

models 64

Lichang-Chen/Qwen2.5-14B-Instruct-star-nl-3Rounds-iter-3

Text Generation • Updated 12 days ago • 8

Lichang-Chen/Qwen2.5-14B-Instruct-star-nl-3Rounds-iter-2

Text Generation • Updated 12 days ago • 4

Lichang-Chen/Qwen2.5-14B-Instruct-star-nl-3Rounds-iter-1

Text Generation • Updated 12 days ago • 17

Lichang-Chen/game-play-point25-50

Text Generation • Updated Jan 24 • 11

Lichang-Chen/multi-attempts-multi-examples-Jan9

Text Generation • Updated Jan 10 • 12

Lichang-Chen/multi-turn-Jan5

Text Generation • Updated Jan 5 • 11

Lichang-Chen/multi-turn-Jan4

Text Generation • Updated Jan 5 • 8

Lichang-Chen/llama3-dpo-single-turn-point2247-dec15

Text Generation • Updated Dec 17, 2024 • 9

Lichang-Chen/llama-8b-gemini-point60-100-wo-cot

Text Generation • Updated Nov 18, 2024 • 9

Lichang-Chen/llama-8b-gemini-point21-60-wo-cot

Text Generation • Updated Nov 18, 2024 • 9

datasets 18

Lichang-Chen/omnixR-data

Viewer • Updated Nov 26, 2024 • 1.4k • 18

Lichang-Chen/llama_sft_dpo_bold_list_attack_eval_iter3

Viewer • Updated Sep 18, 2024 • 800 • 60

Lichang-Chen/llama_sft_dpo_bold_list_attack_eval_iter2

Viewer • Updated Sep 18, 2024 • 800 • 50

Lichang-Chen/llama_sft_dpo_bold_list_attack_iter1

Viewer • Updated Sep 18, 2024 • 800 • 52

Lichang-Chen/dpo_it_attack_list_and_bold

Viewer • Updated Sep 18, 2024 • 800 • 57

Lichang-Chen/llama3_it_dpo_attack_list_2epoch

Viewer • Updated Sep 18, 2024 • 800 • 47

Lichang-Chen/llama3_it_dpo_attack_bold_2epoch

Viewer • Updated Sep 18, 2024 • 800 • 68

Lichang-Chen/dpo_it_unbiased_ver3

Viewer • Updated Sep 18, 2024 • 800 • 55

Lichang-Chen/list_training_pairs

Viewer • Updated Sep 18, 2024 • 1k • 46

Lichang-Chen/bold_training_pairs

Viewer • Updated Sep 18, 2024 • 745 • 62

Lichang Chen

AI & ML interests

Recent Activity

Organizations

Collections 1

Lichang-Chen/ODIN_L1_O1

Lichang-Chen/ODIN_L1

Lichang-Chen/ODIN-ReMax-L230-best

Lichang-Chen/ODIN-ReMax-L255-best

Papers 12

spaces 2

Reward Decomposition

DEFT

models 64

Lichang-Chen/Qwen2.5-14B-Instruct-star-nl-3Rounds-iter-3

Lichang-Chen/Qwen2.5-14B-Instruct-star-nl-3Rounds-iter-2

Lichang-Chen/Qwen2.5-14B-Instruct-star-nl-3Rounds-iter-1

Lichang-Chen/game-play-point25-50

Lichang-Chen/multi-attempts-multi-examples-Jan9

Lichang-Chen/multi-turn-Jan5

Lichang-Chen/multi-turn-Jan4

Lichang-Chen/llama3-dpo-single-turn-point2247-dec15

Lichang-Chen/llama-8b-gemini-point60-100-wo-cot

Lichang-Chen/llama-8b-gemini-point21-60-wo-cot

datasets 18

Lichang-Chen/omnixR-data

Lichang-Chen/llama_sft_dpo_bold_list_attack_eval_iter3

Lichang-Chen/llama_sft_dpo_bold_list_attack_eval_iter2

Lichang-Chen/llama_sft_dpo_bold_list_attack_iter1

Lichang-Chen/dpo_it_attack_list_and_bold

Lichang-Chen/llama3_it_dpo_attack_list_2epoch

Lichang-Chen/llama3_it_dpo_attack_bold_2epoch

Lichang-Chen/dpo_it_unbiased_ver3

Lichang-Chen/list_training_pairs

Lichang-Chen/bold_training_pairs

Lichang Chen

AI & ML interests

Recent Activity

Organizations

Collections 1

Papers 12

spaces 2 Sort: Recently updated

Reward Decomposition

DEFT

models 64 Sort: Recently updated

datasets 18 Sort: Recently updated

spaces 2

models 64

datasets 18