ContinuousAT
/

Phi-CAPO

Model card Files Files and versions Community

SchwinnL commited on Jun 21

Commit

a6add25

•

1 Parent(s): 0019314

Update README.md

fixed CAT to CAPO

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -5,7 +5,7 @@ base_model: Phi-3-mini-4k-instruct
 # Model Card for Model ID
-In this repo are LoRa weights of the Phi-3-mini-4k-instruct model (https://huggingface.co/microsoft/Phi-3-mini-4k-instruct) finetuned with the Continuous Adversarial Training (CAT) algorithm.
 For more information, see our paper "Efficient Adversarial Training in LLMs with Continuous Attacks" (https://arxiv.org/abs/2405.15589)
 ## Github

 # Model Card for Model ID
+In this repo are LoRa weights of the Phi-3-mini-4k-instruct model (https://huggingface.co/microsoft/Phi-3-mini-4k-instruct) finetuned with the Continuous Adversarial Preference Optimisation (CAPO) algorithm.
 For more information, see our paper "Efficient Adversarial Training in LLMs with Continuous Attacks" (https://arxiv.org/abs/2405.15589)
 ## Github