4 5 174

Jian Hu

chuyi777

https://hujian.website

hijkzzz

AI & ML interests

Reinforcement Learning

Recent Activity

upvoted a paper 2 days ago

s1: Simple test-time scaling

liked a model 20 days ago

CohereForAI/c4ai-command-r7b-12-2024

upvoted a paper 28 days ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

View all activity

Organizations

chuyi777's activity

upvoted a paper 2 days ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published 5 days ago • 72

liked a model 20 days ago

CohereForAI/c4ai-command-r7b-12-2024

Text Generation • Updated 7 days ago • 11.5k • 351

upvoted a paper 28 days ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published Jan 4 • 90

commented a paper 28 days ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published Jan 4 • 90 •

liked 2 datasets about 1 month ago

AI-MO/NuminaMath-CoT

Viewer • Updated Nov 25, 2024 • 860k • 8.76k • 348

yingyingzhang/metamath-qwen2-math

Viewer • Updated Oct 1, 2024 • 467k • 284 • 30

upvoted a paper about 2 months ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 79

updated 3 models 2 months ago

liked a model 3 months ago

O1-OPEN/OpenO1-LLama-8B-v0.1

Updated Oct 8, 2024 • 454 • 15

updated a model 3 months ago

OpenRLHF/Mistral-7b-PRM-Math-Shepherd

Updated Oct 30, 2024 • 5 • 1

New activity in OpenRLHF/Mistral-7b-PRM-Math-Shepherd 3 months ago

怎么下载模型呢？

#1 opened 3 months ago by

Yutong001

liked a model 3 months ago

AI-MO/NuminaMath-7B-TIR

Text Generation • Updated Aug 14, 2024 • 6.43k • 334

liked 2 models 4 months ago

Nexusflow/Athene-70B

Text Generation • Updated Nov 15, 2024 • 7.28k • 194

peiyi9979/mistral-7b-sft

Text Generation • Updated Jan 15, 2024 • 1.63k • 7

liked 2 datasets 4 months ago

nvidia/HelpSteer2

Viewer • Updated Dec 18, 2024 • 21.4k • 16.2k • 401

GAIR/o1-journey

Viewer • Updated Oct 16, 2024 • 327 • 346 • 132

liked 2 models 4 months ago

peiyi9979/math-shepherd-mistral-7b-rl

Text Generation • Updated Jan 15, 2024 • 251 • 5

peiyi9979/math-shepherd-mistral-7b-prm

Text Generation • Updated Jan 15, 2024 • 6.46k • 43