Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
6
5
18
Hanze Dong
hendrydong
Follow
GigaBoy's profile picture
dvilasuero's profile picture
yifAI's profile picture
10 followers
·
7 following
https://hendrydong.github.io
hendrydong
AI & ML interests
None yet
Recent Activity
authored
a paper
3 days ago
Offline Reinforcement Learning for LLM Multi-Step Reasoning
upvoted
a
paper
3 days ago
Offline Reinforcement Learning for LLM Multi-Step Reasoning
new
activity
about 1 month ago
RLHFlow/LLaMA3.2-1B-SFT:
the training data for this model?
View all activity
Organizations
hendrydong
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
a model
3 months ago
rhymes-ai/Aria
Image-Text-to-Text
•
Updated
8 days ago
•
17.9k
•
600
liked
a model
6 months ago
sfairXC/FsfairX-Gemma2-RM-v0.1
Text Classification
•
Updated
Jul 9
•
2.2k
•
5
liked
a dataset
7 months ago
Locutusque/function-calling-chatml
Viewer
•
Updated
Jul 16
•
113k
•
184
•
160
liked
a model
7 months ago
RLHFlow/pair-preference-model-LLaMA3-8B
Text Generation
•
Updated
Oct 14
•
4.22k
•
38
liked
a dataset
7 months ago
weqweasdas/ultra_prompt_split
Viewer
•
Updated
Mar 20
•
60k
•
41
•
2
liked
6 models
8 months ago
Salesforce/LLaMA-3-8B-SFR-RM-R
Text Classification
•
Updated
May 31
•
5
•
11
Salesforce/LLaMA-3-8B-SFR-Iterative-DPO-R
Text Generation
•
Updated
Jun 12
•
209
•
76
meta-llama/Meta-Llama-3-70B-Instruct
Text Generation
•
Updated
11 days ago
•
87k
•
1.45k
meta-llama/Meta-Llama-3-8B
Text Generation
•
Updated
Sep 27
•
567k
•
5.93k
sfairXC/FsfairX-Zephyr-Chat-v0.1
Text Generation
•
Updated
Apr 24
•
23
•
8
sfairXC/FsfairX-LLaMA3-RM-v0.1
Text Classification
•
Updated
Oct 14
•
10.9k
•
52
liked
2 models
9 months ago
weqweasdas/RM-Mistral-7B
Text Classification
•
Updated
Mar 31
•
1.64k
•
22
hendrydong/Mistral-RM-for-RAFT-GSHF-v0
Text Classification
•
Updated
Mar 23
•
18
•
1
liked
2 models
10 months ago
weqweasdas/RM-Gemma-2B
Text Classification
•
Updated
Mar 22
•
1.22k
•
17
google/gemma-2b-it
Text Generation
•
Updated
Sep 27
•
85.6k
•
685
liked
a model
about 1 year ago
microsoft/phi-2
Text Generation
•
Updated
Apr 29
•
183k
•
•
3.26k
liked
a model
over 1 year ago
Salesforce/xgen-7b-8k-base
Text Generation
•
Updated
Feb 7
•
1.33k
•
318
liked
a Space
over 1 year ago
Runtime error
66
🔥
Robin 7b