monsoon-nlp
/

codellama-abliterated-2xd

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

monsoon-nlp commited on Jul 26

Commit

d65a7dd

•

1 Parent(s): 9cef505

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -13,7 +13,8 @@ CodeLlama-7b-Instruct-hf adapted using the abliteration notebook from [Maxime La
 Based on the paper ["Refusal in Language Models Is Mediated by a Single Direction"](https://arxiv.org/abs/2406.11717)
-**This version 2x-d the intervention vector**; see code model with less intervention: https://huggingface.co/monsoon-nlp/codellama-abliterated
 **Based on CodeLlama/Llama2 and subject to the restrictions of that model and license - not for unapproved uses**:

 Based on the paper ["Refusal in Language Models Is Mediated by a Single Direction"](https://arxiv.org/abs/2406.11717)
+**This version 2x-d the intervention vector**; in practice this repeats phrases or writes text instead of answering difficult questions.
+See the model with less intervention: https://huggingface.co/monsoon-nlp/codellama-abliterated
 **Based on CodeLlama/Llama2 and subject to the restrictions of that model and license - not for unapproved uses**: