Wei Xiong

weqweasdas

AI & ML interests

Machine learning, RLHF

Recent Activity

Organizations

reward modeling's profile picture raft_study's profile picture Directional Preference Alignment's profile picture RLHFlow's profile picture RRLHF's profile picture TIRData's profile picture feedbackagent's profile picture selfrew's profile picture myselfrew's profile picture selfcorrexp's profile picture selfcorrexp2's profile picture mytestdpo's profile picture

weqweasdas's activity

New activity in RLHFlow/LLaMA3-SFT 4 months ago

LLaMA3.1-SFT

3
#3 opened 4 months ago by
jackzhang
New activity in Qwen/Qwen2.5-Math-RM-72B 4 months ago
New activity in RLHFlow/LLaMA3-SFT 5 months ago
New activity in RLHF4MATH/Gemma-7B-it-SFT3epoch 6 months ago

Update README.md

#1 opened 6 months ago by
weqweasdas
New activity in RLHFlow/ArmoRM-Llama3-8B-v0.1 6 months ago
New activity in weqweasdas/RM-Mistral-7B 8 months ago

why vocab size is 32001

1
#3 opened 8 months ago by
yechenzhi1
New activity in weqweasdas/RM-Mistral-7B 9 months ago

License

1
#2 opened 9 months ago by
ravir123
New activity in weqweasdas/RM-Mistral-7B 10 months ago

Fix dataset link

#1 opened 10 months ago by
ZennyKenny