Update README.md
Browse files
README.md
CHANGED
|
@@ -8,9 +8,10 @@ tags:
|
|
| 8 |
- nlp
|
| 9 |
- llm
|
| 10 |
---
|
| 11 |
-
# K2: a Fully Transparent OSS
|
| 12 |
|
| 13 |
-
LLM360 demystifies the
|
|
|
|
| 14 |
|
| 15 |
## Evaluations
|
| 16 |
<center><img src="eval_table_temp.png" alt="eval table"/></center>
|
|
|
|
| 8 |
- nlp
|
| 9 |
- llm
|
| 10 |
---
|
| 11 |
+
# K2: a Fully Transparent OSS Language at Llama 2 Performance Using 35% Less Compute
|
| 12 |
|
| 13 |
+
LLM360 demystifies the training recipe used for Llama 2 - 70B with K2. Reaching a comparable performance with Llama 2, K2 has 65B parameters
|
| 14 |
+
and is trained on around 1.4T tokens, resulting a receipe of approximately 35% less compute.
|
| 15 |
|
| 16 |
## Evaluations
|
| 17 |
<center><img src="eval_table_temp.png" alt="eval table"/></center>
|