The ODIN and the policies trained by ODIN
Lichang Chen
Lichang-Chen
AI & ML interests
NLP and ML
Recent Activity
authored
a paper
9 days ago
Self-rewarding correction for mathematical reasoning
upvoted
a
paper
10 days ago
Self-rewarding correction for mathematical reasoning
updated
a model
12 days ago
Lichang-Chen/Qwen2.5-14B-Instruct-star-nl-3Rounds-iter-3
Organizations
Collections
1
models
64

Lichang-Chen/Qwen2.5-14B-Instruct-star-nl-3Rounds-iter-3
Text Generation
•
Updated
•
8

Lichang-Chen/Qwen2.5-14B-Instruct-star-nl-3Rounds-iter-2
Text Generation
•
Updated
•
4

Lichang-Chen/Qwen2.5-14B-Instruct-star-nl-3Rounds-iter-1
Text Generation
•
Updated
•
17

Lichang-Chen/game-play-point25-50
Text Generation
•
Updated
•
11

Lichang-Chen/multi-attempts-multi-examples-Jan9
Text Generation
•
Updated
•
12

Lichang-Chen/multi-turn-Jan5
Text Generation
•
Updated
•
11

Lichang-Chen/multi-turn-Jan4
Text Generation
•
Updated
•
8

Lichang-Chen/llama3-dpo-single-turn-point2247-dec15
Text Generation
•
Updated
•
9

Lichang-Chen/llama-8b-gemini-point60-100-wo-cot
Text Generation
•
Updated
•
9

Lichang-Chen/llama-8b-gemini-point21-60-wo-cot
Text Generation
•
Updated
•
9
datasets
18
Lichang-Chen/omnixR-data
Viewer
•
Updated
•
1.4k
•
18
Lichang-Chen/llama_sft_dpo_bold_list_attack_eval_iter3
Viewer
•
Updated
•
800
•
60
Lichang-Chen/llama_sft_dpo_bold_list_attack_eval_iter2
Viewer
•
Updated
•
800
•
50
Lichang-Chen/llama_sft_dpo_bold_list_attack_iter1
Viewer
•
Updated
•
800
•
52
Lichang-Chen/dpo_it_attack_list_and_bold
Viewer
•
Updated
•
800
•
57
Lichang-Chen/llama3_it_dpo_attack_list_2epoch
Viewer
•
Updated
•
800
•
47
Lichang-Chen/llama3_it_dpo_attack_bold_2epoch
Viewer
•
Updated
•
800
•
68
Lichang-Chen/dpo_it_unbiased_ver3
Viewer
•
Updated
•
800
•
55
Lichang-Chen/list_training_pairs
Viewer
•
Updated
•
1k
•
46
Lichang-Chen/bold_training_pairs
Viewer
•
Updated
•
745
•
62