weqweasdas
commited on
Commit
•
f645c10
1
Parent(s):
1d12686
Update README.md
Browse files
README.md
CHANGED
@@ -35,7 +35,7 @@ The model is trained from [meta-llama/Llama-3.1-8B-Instruct](https://huggingface
|
|
35 |
|
36 |
## Usage
|
37 |
|
38 |
-
See https://github.com/RLHFlow/RLHF-Reward-Modeling/tree/main/math for detailed examples.
|
39 |
|
40 |
## Citation
|
41 |
|
|
|
35 |
|
36 |
## Usage
|
37 |
|
38 |
+
See https://github.com/RLHFlow/RLHF-Reward-Modeling/tree/main/math-rm for detailed examples.
|
39 |
|
40 |
## Citation
|
41 |
|