rl-llm-agent
/

Llama-3.2-3B-Instruct-online-dpo-alfworld-iter0

Model card Files Files and versions Community

Llama-3.2-3B-Instruct-online-dpo-alfworld-iter0 / pytorch_model.bin.index.json

Commit History

upload checkpoint

da3f523
verified

sc2582 commited on 22 days ago