Improve model card

#3
by nielsr HF staff - opened
Files changed (1) hide show
  1. README.md +3 -4
README.md CHANGED
@@ -9,13 +9,13 @@ tags:
9
  model-index:
10
  - name: GuardReasoner 3B
11
  results: []
12
- pipeline_tag: text-generation
13
  ---
14
 
15
  # GuardReasoner 3B
16
 
17
  This model is a fine-tuned version of [meta-llama/Llama-3.2-3B](https://huggingface.co/meta-llama/Llama-3.2-3B) via R-SFT and HS-DPO, as described in [GuardReasoner: Towards Reasoning-based LLM Safeguards](https://huggingface.co/papers/2501.18492).
18
-
19
 
20
  The training data of R-SFT can be found in [GuardReasonerTrain](https://huggingface.co/datasets/yueliu1999/GuardReasonerTrain).
21
 
@@ -80,5 +80,4 @@ print(output)
80
  journal={arXiv preprint arXiv:2501.18492},
81
  year={2025}
82
  }
83
- ```
84
-
 
9
  model-index:
10
  - name: GuardReasoner 3B
11
  results: []
12
+ pipeline_tag: text-classification
13
  ---
14
 
15
  # GuardReasoner 3B
16
 
17
  This model is a fine-tuned version of [meta-llama/Llama-3.2-3B](https://huggingface.co/meta-llama/Llama-3.2-3B) via R-SFT and HS-DPO, as described in [GuardReasoner: Towards Reasoning-based LLM Safeguards](https://huggingface.co/papers/2501.18492).
18
+ Code: https://github.com/yueliu1999/GuardReasoner/.
19
 
20
  The training data of R-SFT can be found in [GuardReasonerTrain](https://huggingface.co/datasets/yueliu1999/GuardReasonerTrain).
21
 
 
80
  journal={arXiv preprint arXiv:2501.18492},
81
  year={2025}
82
  }
83
+ ```