OREAL - a internlm Collection

internlm 's Collections

InternLM-XComposer2.5

OREAL

InternLM2-Reward

InternLM-XComposer2

OREAL

updated 27 days ago

internlm/OREAL-32B

Text Generation • Updated 14 days ago • 1.43k • 21
internlm/OREAL-7B

Text Generation • Updated 14 days ago • 570 • 19
internlm/OREAL-DeepSeek-R1-Distill-Qwen-7B

Text Generation • Updated 14 days ago • 1.42k • 8
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

Paper • 2502.06781 • Published 27 days ago • 60
internlm/OREAL-32B-SFT

Question Answering • Updated 14 days ago • 1.86k • 5
internlm/OREAL-7B-SFT

Text Generation • Updated 14 days ago • 165 • 1
internlm/OREAL-RL-Prompts

Viewer • Updated 21 days ago • 4.21k • 333 • 9