Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
RL LLM Coders
university
https://github.com/portal-cornell/rl_llm_coders
Activity Feed
Follow
4
AI & ML interests
None defined yet.
Recent Activity
chalo2000
authored
a paper
7 days ago
Multi-Turn Code Generation Through Single-Step Rewards
chalo2000
authored
a paper
16 days ago
Robotouille: An Asynchronous Planning Benchmark for LLM Agents
arnavkj1995
updated
a model
about 1 month ago
rl-llm-coders/RS_GT_RM_8B_iter2
View all activity
Team members
4
models
34
Sort: Recently updated
rl-llm-coders/RS_GT_RM_8B_iter2
Text Classification
•
Updated
Jan 30
•
216
rl-llm-coders/Multi_SFT_8B
Text Generation
•
Updated
Jan 30
•
347
rl-llm-coders/RS_SFT_8B_iter2
Text Generation
•
Updated
Jan 30
•
457
rl-llm-coders/RS_SFT_8B_iter1
Text Generation
•
Updated
Jan 30
•
16
rl-llm-coders/RS_GT_SFT_8B_iter2
Text Generation
•
Updated
Jan 30
•
307
rl-llm-coders/RS_GT_SFT_8B_iter1
Text Generation
•
Updated
Jan 30
•
15
rl-llm-coders/RS_GT_RM_8B_iter1
Text Classification
•
Updated
Jan 30
•
14
rl-llm-coders/RS_GT_RM_8B
Text Classification
•
Updated
Jan 30
•
11
rl-llm-coders/Final_RS_RM_8B_iter2
Text Classification
•
Updated
Jan 30
•
206
rl-llm-coders/Final_RS_RM_8B_iter1
Text Classification
•
Updated
Jan 30
•
10
Expand 34 models
datasets
None public yet