thu-coai
/

ShieldLM-13B-baichuan2

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

nonstopfor commited on Feb 27

Commit

d9d5698

•

1 Parent(s): 99e90a6

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -5,7 +5,7 @@ language:
 - zh
 ---
 ## Introduction
-The ShieldLM model ([paper link](xxx)) initialized from [Baichuan2-13B-Chat](https://huggingface.co/baichuan-inc/Baichuan2-13B-Chat). ShieldLM is a bilingual (Chinese and English) safety detector that mainly aims to help to detect safety issues in LLMs' generations. It aligns with general human safety standards, supports fine-grained customizable detection rules, and provides explanations for its decisions.
 Refer to our [github repository](https://github.com/thu-coai/ShieldLM) for more detailed information.
 ## Usage
@@ -13,4 +13,4 @@ Please refer to our [github repository](https://github.com/thu-coai/ShieldLM) fo
 ## Performance
 ShieldLM demonstrates impressive detection performance across 4 ID and OOD test sets, compared to strong baselines such as GPT-4, Llama Guard and Perspective API.
-Refer to [our paper](xxx) for more detailed evaluation results.

 - zh
 ---
 ## Introduction
+The ShieldLM model ([paper link](https://arxiv.org/abs/2402.16444)) initialized from [Baichuan2-13B-Chat](https://huggingface.co/baichuan-inc/Baichuan2-13B-Chat). ShieldLM is a bilingual (Chinese and English) safety detector that mainly aims to help to detect safety issues in LLMs' generations. It aligns with general human safety standards, supports fine-grained customizable detection rules, and provides explanations for its decisions.
 Refer to our [github repository](https://github.com/thu-coai/ShieldLM) for more detailed information.
 ## Usage
 ## Performance
 ShieldLM demonstrates impressive detection performance across 4 ID and OOD test sets, compared to strong baselines such as GPT-4, Llama Guard and Perspective API.
+Refer to [our paper](https://arxiv.org/abs/2402.16444) for more detailed evaluation results.