abacusai
/

Llama-3-Giraffe-70B-Instruct

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

siddartha-abacus commited on May 4

Commit

8233a1a

•

1 Parent(s): 0770313

Update README.md

Files changed (1) hide show

README.md +18 -0

README.md CHANGED Viewed

@@ -25,6 +25,24 @@ There are our Needle-in-a-Haystack heatmap results. We are conducting further ev
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/64c14f6b02e1f8f67c73bd05/Z4uUhcjgf1P7EPGQyRLkW.png)
 ## Training Methodology
 The methodology for training uses [PoSE](https://arxiv.org/abs/2309.10400) and dynamic-NTK interpolation.

 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/64c14f6b02e1f8f67c73bd05/Z4uUhcjgf1P7EPGQyRLkW.png)
+### MT-Bench Evaluation
+We also measured performance on MT-Bench to verify that the context extension did not significantly impact performance on instruct tasks:
+```
+####### 1st turn:
+Meta-Llama-3-70B-Instruct      9.21
+Llama-3-Giraffe-70B-Instruct 9.19
+####### 2nd turn:
+Meta-Llama-3-70B-Instruct     2   8.80
+Llama-3-Giraffe-70B-Instruct 2   8.54
+####### average:
+Meta-Llama-3-70B-Instruct      9.00
+Llama-3-Giraffe-70B-Instruct 8.87
+```
 ## Training Methodology
 The methodology for training uses [PoSE](https://arxiv.org/abs/2309.10400) and dynamic-NTK interpolation.