hunterhector
commited on
Commit
•
efd859b
1
Parent(s):
7c8f87a
Update README.md
Browse files
README.md
CHANGED
@@ -8,9 +8,10 @@ tags:
|
|
8 |
- nlp
|
9 |
- llm
|
10 |
---
|
11 |
-
# K2: a Fully Transparent OSS
|
12 |
|
13 |
-
LLM360 demystifies the
|
|
|
14 |
|
15 |
## Evaluations
|
16 |
<center><img src="eval_table_temp.png" alt="eval table"/></center>
|
|
|
8 |
- nlp
|
9 |
- llm
|
10 |
---
|
11 |
+
# K2: a Fully Transparent OSS Language at Llama 2 Performance Using 35% Less Compute
|
12 |
|
13 |
+
LLM360 demystifies the training recipe used for Llama 2 - 70B with K2. Reaching a comparable performance with Llama 2, K2 has 65B parameters
|
14 |
+
and is trained on around 1.4T tokens, resulting a receipe of approximately 35% less compute.
|
15 |
|
16 |
## Evaluations
|
17 |
<center><img src="eval_table_temp.png" alt="eval table"/></center>
|