rl-llm-agent
/

Llama-3.2-3B-Instruct-reward-alfworld-iqlearn-iter1

Model card Files Files and versions Community

Resources

View closed (0)

Adding `safetensors` variant of this model

#2 opened 6 days ago by

Adding `safetensors` variant of this model

#1 opened 11 days ago by