library_name: transformers | |
license: other | |
base_model: meta-llama/Llama-3.2-3B | |
tags: | |
- llama-factory | |
- full | |
- generated_from_trainer | |
model-index: | |
- name: GuardReasoner 3B | |
results: [] | |
# GuardReasoner 3B | |
This model is a fine-tuned version of [meta-llama/Llama-3.2-3B](https://huggingface.co./meta-llama/Llama-3.2-3B) via R-SFT and HS-DPO. | |