weqweasdas
commited on
Commit
•
360547e
1
Parent(s):
c20c9f0
Update README.md
Browse files
README.md
CHANGED
@@ -15,6 +15,7 @@ See the [collection](https://huggingface.co/collections/RLHFlow/online-rlhf-663a
|
|
15 |
|
16 |
- [SFT model](https://huggingface.co/RLHFlow/LLaMA3-SFT)
|
17 |
- [Reward model](https://huggingface.co/sfairXC/FsfairX-LLaMA3-RM-v0.1)
|
|
|
18 |
|
19 |
## Dataset
|
20 |
- [Preference data mix](https://huggingface.co/datasets/hendrydong/preference_700K)
|
|
|
15 |
|
16 |
- [SFT model](https://huggingface.co/RLHFlow/LLaMA3-SFT)
|
17 |
- [Reward model](https://huggingface.co/sfairXC/FsfairX-LLaMA3-RM-v0.1)
|
18 |
+
- This model is more like the concise version in the report. We are still working on the model realeasing due to some license issue....
|
19 |
|
20 |
## Dataset
|
21 |
- [Preference data mix](https://huggingface.co/datasets/hendrydong/preference_700K)
|