RuiyangSun
commited on
Commit
•
32e35c1
1
Parent(s):
588a9a4
docs: update readme
Browse files
README.md
CHANGED
@@ -20,7 +20,7 @@ library_name: safe-rlhf
|
|
20 |
|
21 |
## Model Details
|
22 |
|
23 |
-
The Beaver Cost model is a preference model trained using the [PKU-SafeRLHF](https://huggingface.co/datasets/PKU-Alignment/PKU-SafeRLHF dataset
|
24 |
It can play a role in the safe RLHF algorithm, helping the Beaver model become more safe and harmless.
|
25 |
|
26 |
- **Developed by:** the [PKU-Alignment](https://github.com/PKU-Alignment) Team.
|
|
|
20 |
|
21 |
## Model Details
|
22 |
|
23 |
+
The Beaver Cost model is a preference model trained using the [PKU-SafeRLHF](https://huggingface.co/datasets/PKU-Alignment/PKU-SafeRLHF) dataset.
|
24 |
It can play a role in the safe RLHF algorithm, helping the Beaver model become more safe and harmless.
|
25 |
|
26 |
- **Developed by:** the [PKU-Alignment](https://github.com/PKU-Alignment) Team.
|