princeton-nlp
commited on
Commit
•
c580d90
1
Parent(s):
ebdb01f
Update README.md
Browse files
README.md
CHANGED
@@ -49,11 +49,11 @@ print(outputs[0]['generated_text'])
|
|
49 |
|
50 |
### Training Data
|
51 |
|
52 |
-
We use
|
53 |
|
54 |
#### Training Hyperparameters
|
55 |
|
56 |
-
|
57 |
|
58 |
#### Speeds, Sizes, Times
|
59 |
|
|
|
49 |
|
50 |
### Training Data
|
51 |
|
52 |
+
We use [princeton-nlp/gemma2-ultrafeedback-armorm](https://huggingface.co/datasets/princeton-nlp/gemma2-ultrafeedback-armorm) as the preference optimization dataset.
|
53 |
|
54 |
#### Training Hyperparameters
|
55 |
|
56 |
+
[TO BE FILLED LATER]
|
57 |
|
58 |
#### Speeds, Sizes, Times
|
59 |
|