GuardReasoner-3B / README.md
yueliu1999's picture
update
6de1f09
|
raw
history blame
350 Bytes
---
library_name: transformers
license: other
base_model: meta-llama/Llama-3.2-3B
tags:
- llama-factory
- full
- generated_from_trainer
model-index:
- name: GuardReasoner 3B
results: []
---
# GuardReasoner 3B
This model is a fine-tuned version of [meta-llama/Llama-3.2-3B](https://huggingface.co./meta-llama/Llama-3.2-3B) via R-SFT and HS-DPO.