hbseong commited on
Commit
f2e8429
1 Parent(s): 8f8c7e2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -11
README.md CHANGED
@@ -1,17 +1,17 @@
1
- ---
2
- tags:
3
- - deberta-v3
4
- - deberta
5
- - deberta-v2
6
- license: mit
7
- base_model:
8
- - microsoft/deberta-v3-large
9
- pipeline_tag: text-classification
10
- ---
11
 
12
  # HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models
13
 
14
-
15
 
16
  Our model functions as a Guard Model, intended to classify the safety of conversations with LLMs and protect against LLM jailbreak attacks.
17
  It is fine-tuned from DeBERTa-v3-large and trained using **HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models**.
 
1
+ ---
2
+ tags:
3
+ - deberta-v3
4
+ - deberta
5
+ - deberta-v2
6
+ license: mit
7
+ base_model:
8
+ - microsoft/deberta-v3-large
9
+ pipeline_tag: text-classification
10
+ ---
11
 
12
  # HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models
13
 
14
+ [Arxiv Link](https://arxiv.org/abs/2410.01524)
15
 
16
  Our model functions as a Guard Model, intended to classify the safety of conversations with LLMs and protect against LLM jailbreak attacks.
17
  It is fine-tuned from DeBERTa-v3-large and trained using **HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models**.