RLHFlow
/

Llama3.1-8B-PRM-Deepseek-Data

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

weqweasdas commited on 13 days ago

Commit

f645c10

•

1 Parent(s): 1d12686

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -35,7 +35,7 @@ The model is trained from [meta-llama/Llama-3.1-8B-Instruct](https://huggingface
 ## Usage
-See https://github.com/RLHFlow/RLHF-Reward-Modeling/tree/main/math for detailed examples.
 ## Citation

 ## Usage
+See https://github.com/RLHFlow/RLHF-Reward-Modeling/tree/main/math-rm for detailed examples.
 ## Citation