weqweasdas commited on
Commit
360547e
1 Parent(s): c20c9f0

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -0
README.md CHANGED
@@ -15,6 +15,7 @@ See the [collection](https://huggingface.co/collections/RLHFlow/online-rlhf-663a
15
 
16
  - [SFT model](https://huggingface.co/RLHFlow/LLaMA3-SFT)
17
  - [Reward model](https://huggingface.co/sfairXC/FsfairX-LLaMA3-RM-v0.1)
 
18
 
19
  ## Dataset
20
  - [Preference data mix](https://huggingface.co/datasets/hendrydong/preference_700K)
 
15
 
16
  - [SFT model](https://huggingface.co/RLHFlow/LLaMA3-SFT)
17
  - [Reward model](https://huggingface.co/sfairXC/FsfairX-LLaMA3-RM-v0.1)
18
+ - This model is more like the concise version in the report. We are still working on the model realeasing due to some license issue....
19
 
20
  ## Dataset
21
  - [Preference data mix](https://huggingface.co/datasets/hendrydong/preference_700K)