mike-conover-db
commited on
Commit
•
55215ac
1
Parent(s):
5a67c3f
Updating README.
Browse files
README.md
CHANGED
@@ -54,8 +54,8 @@ maximize the potential of all individuals and organizations.
|
|
54 |
|
55 |
### Benchmark Metrics
|
56 |
|
57 |
-
Below you'll find various models benchmark performance on the [EleutherAI LLM Evaluation Harness](https://github.com/EleutherAI/lm-evaluation-harness)
|
58 |
-
model results are sorted by geometric mean to produce an intelligible ordering.
|
59 |
and in fact underperforms `dolly-v1-6b` in some evaluation benchmarks. We believe this owes to the composition and size of the underlying fine tuning datasets,
|
60 |
but a robust statement as to the sources of these variations requires further study.
|
61 |
|
|
|
54 |
|
55 |
### Benchmark Metrics
|
56 |
|
57 |
+
Below you'll find various models benchmark performance on the [EleutherAI LLM Evaluation Harness](https://github.com/EleutherAI/lm-evaluation-harness);
|
58 |
+
model results are sorted by geometric mean to produce an intelligible ordering. As outlined above, these results demonstrate that `dolly-v2-12b` is not state of the art,
|
59 |
and in fact underperforms `dolly-v1-6b` in some evaluation benchmarks. We believe this owes to the composition and size of the underlying fine tuning datasets,
|
60 |
but a robust statement as to the sources of these variations requires further study.
|
61 |
|