Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
rl-llm-agent
/
Llama-3.2-3B-Instruct-reward-alfworld-iqlearn-iter1
like
0
Follow
RL LLM AGENT
2
PyTorch
llama
Model card
Files
Files and versions
Community
2
Train
New discussion
New pull request
Resources
PR & discussions documentation
Code of Conduct
Hub documentation
All
Discussions
Pull requests
View closed (0)
Adding `safetensors` variant of this model
#2 opened 6 days ago by
SFconvertbot
Adding `safetensors` variant of this model
#1 opened 11 days ago by
SFconvertbot