Update README.md
Browse files
README.md
CHANGED
@@ -32,7 +32,7 @@ We evaluate the model on [RewardBench](https://github.com/allenai/reward-bench):
|
|
32 |
| Model | Score | Chat | Chat Hard | Safety | Reasoning |
|
33 |
|------------------|-------|-------|-----------|--------|-----------|
|
34 |
| [Llama 3 Tulu 2 8b UF RM](https://huggingface.co/allenai/llama-3-tulu-2-8b-uf-mean-rm) | 73.6 | 95.3 | 59.2 | 57.9 | 82.1 |
|
35 |
-
| **[Llama 3 Tulu 2 70b UF RM](https://huggingface.co/allenai/llama-3-tulu-2-70b-uf-mean-rm) (this model)** |
|
36 |
|
37 |
|
38 |
|
|
|
32 |
| Model | Score | Chat | Chat Hard | Safety | Reasoning |
|
33 |
|------------------|-------|-------|-----------|--------|-----------|
|
34 |
| [Llama 3 Tulu 2 8b UF RM](https://huggingface.co/allenai/llama-3-tulu-2-8b-uf-mean-rm) | 73.6 | 95.3 | 59.2 | 57.9 | 82.1 |
|
35 |
+
| **[Llama 3 Tulu 2 70b UF RM](https://huggingface.co/allenai/llama-3-tulu-2-70b-uf-mean-rm) (this model)** | 71.0 | 86.3 | 56.1 | 58.9 | 82.7 |
|
36 |
|
37 |
|
38 |
|