Adding Evaluation Results (#2)
Browse files- Adding Evaluation Results (4ae90dfdf965323a1906bb9cd9a6f332405bbfe5)
Co-authored-by: Open LLM Leaderboard PR Bot <[email protected]>
README.md
CHANGED
|
@@ -1,10 +1,10 @@
|
|
| 1 |
---
|
| 2 |
-
library_name: transformers
|
| 3 |
license: llama3.2
|
| 4 |
-
|
| 5 |
tags:
|
| 6 |
- axolotl
|
| 7 |
- generated_from_trainer
|
|
|
|
| 8 |
model-index:
|
| 9 |
- name: Einstein-v8-Llama3.2-1B
|
| 10 |
results: []
|
|
@@ -263,3 +263,17 @@ The following hyperparameters were used during training:
|
|
| 263 |
- Pytorch 2.3.1+cu121
|
| 264 |
- Datasets 2.21.0
|
| 265 |
- Tokenizers 0.20.0
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
|
|
|
| 2 |
license: llama3.2
|
| 3 |
+
library_name: transformers
|
| 4 |
tags:
|
| 5 |
- axolotl
|
| 6 |
- generated_from_trainer
|
| 7 |
+
base_model: meta-llama/Llama-3.2-1B
|
| 8 |
model-index:
|
| 9 |
- name: Einstein-v8-Llama3.2-1B
|
| 10 |
results: []
|
|
|
|
| 263 |
- Pytorch 2.3.1+cu121
|
| 264 |
- Datasets 2.21.0
|
| 265 |
- Tokenizers 0.20.0
|
| 266 |
+
|
| 267 |
+
# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
|
| 268 |
+
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_Weyaxi__Einstein-v8-Llama3.2-1B)
|
| 269 |
+
|
| 270 |
+
| Metric |Value|
|
| 271 |
+
|-------------------|----:|
|
| 272 |
+
|Avg. | 4.63|
|
| 273 |
+
|IFEval (0-Shot) |18.62|
|
| 274 |
+
|BBH (3-Shot) | 3.01|
|
| 275 |
+
|MATH Lvl 5 (4-Shot)| 0.00|
|
| 276 |
+
|GPQA (0-shot) | 1.12|
|
| 277 |
+
|MuSR (0-shot) | 3.22|
|
| 278 |
+
|MMLU-PRO (5-shot) | 1.79|
|
| 279 |
+
|