Hanze Dong
hendrydong
AI & ML interests
None yet
Recent Activity
authored
a paper
3 days ago
Offline Reinforcement Learning for LLM Multi-Step Reasoning
upvoted
a
paper
3 days ago
Offline Reinforcement Learning for LLM Multi-Step Reasoning
new activity
about 1 month ago
RLHFlow/LLaMA3.2-1B-SFT:the training data for this model?
Organizations
hendrydong's activity
the training data for this model?
1
#1 opened about 1 month ago
by
AIR-hl
Update README.md
#4 opened 8 months ago
by
johnowhitaker
Update README.md
#6 opened 2 months ago
by
Haoxiang-Wang
Training details?
1
#2 opened 8 months ago
by
MicPie
How to Train model with AutoModelForSequenceClassification?
4
#20 opened about 1 year ago
by
jerife
Unrecognized model in openbmb/cpm-bee-10b. Should have a `model_type` key in its config.json
4
#1 opened over 1 year ago
by
hendrydong