pbt_llama_3.1_70B / README.md
esteng's picture
Update README.md
4eb1be1 verified
---
license: apache-2.0
---
Model trained to accept and resist persuasion as appropriate, introduced by Stengel-Eskin et al. (2024): arxiv.org/abs/2410.14596