RLHF + Code Vezora/Code-Preference-Pairs Viewer • Updated Jul 28, 2024 • 54k • 57 • 18 quangduc1112001/python-code-DPO-fine-tune Viewer • Updated Nov 4, 2024 • 2k • 82 • 2 xinlai/Math-Step-DPO-10K Viewer • Updated Jul 4, 2024 • 10.8k • 528 • 48 minfeng-ai/leetcode_preference Viewer • Updated Sep 6, 2023 • 457 • 17 • 6