Update README.md
Browse files
README.md
CHANGED
@@ -118,5 +118,12 @@ Refer to our [Github repo](https://github.com/SORRY-Bench/SORRY-Bench) for more
|
|
118 |
## Citation
|
119 |
|
120 |
```
|
121 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
122 |
```
|
|
|
118 |
## Citation
|
119 |
|
120 |
```
|
121 |
+
@misc{xie2024sorrybench,
|
122 |
+
title={SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors},
|
123 |
+
author={Tinghao Xie and Xiangyu Qi and Yi Zeng and Yangsibo Huang and Udari Madhushani Sehwag and Kaixuan Huang and Luxi He and Boyi Wei and Dacheng Li and Ying Sheng and Ruoxi Jia and Bo Li and Kai Li and Danqi Chen and Peter Henderson and Prateek Mittal},
|
124 |
+
year={2024},
|
125 |
+
eprint={2406.14598},
|
126 |
+
archivePrefix={arXiv},
|
127 |
+
primaryClass={cs.AI}
|
128 |
+
}
|
129 |
```
|