tangledgroup
/

tangled-llama-a-128k-base-v0.1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

mtasic85 commited on 1 day ago

Commit

98c1e4f

•

1 Parent(s): 05684bf

pretrain eval

Files changed (1) hide show

README.md +7 -9

README.md CHANGED Viewed

@@ -44,15 +44,17 @@ This model **isn't** designed for immediate use but rather for Continued Pretrai
 The objective is to streamline the cognitive or reasoning core, eliminating any redundant knowledge from the model.
-[loss, val_loss]()
-[val_ppl]()
-[epoch]()
-[learning_rate]()
-## lm-evaluation-harness
 ```bash
 litgpt evaluate --tasks 'hellaswag,gsm8k,truthfulqa_mc2,mmlu,winogrande,arc_challenge' --out_dir 'evaluate-quick/' --batch_size 4 --dtype 'bfloat16' out/pretrain/final/
@@ -256,10 +258,6 @@ litgpt evaluate --tasks 'arc_challenge,boolq,gpqa,hellaswag,openbookqa,piqa,trut
 |truthfulqa_mc2                 |      2|none            |     0|acc        |↑  |0.5061|±  |0.0167|
 |winogrande                     |      1|none            |     0|acc        |↑  |0.4933|±  |0.0141|
-```bash
-litgpt evaluate --tasks 'mmlu_multilingual,mgsm' --out_dir 'evaluate-multilinguals/' --batch_size 4 --dtype 'bfloat16' out/pretrain/final/
-```
 ```bash
 litgpt evaluate --tasks 'wikitext,qasper' --out_dir 'evaluate-long/' --batch_size 4 --dtype 'bfloat16' out/pretrain/final/
 ```

 The objective is to streamline the cognitive or reasoning core, eliminating any redundant knowledge from the model.
+[loss, val_loss](https://api.wandb.ai/links/mtasic85/strnx9rl)
+[val_ppl](https://api.wandb.ai/links/mtasic85/ljwxf4am)
+[epoch](https://api.wandb.ai/links/mtasic85/edyph869)
+[learning_rate](https://api.wandb.ai/links/mtasic85/eswxyger)
+## Pretrain Evaluation
+### lm-evaluation-harness
 ```bash
 litgpt evaluate --tasks 'hellaswag,gsm8k,truthfulqa_mc2,mmlu,winogrande,arc_challenge' --out_dir 'evaluate-quick/' --batch_size 4 --dtype 'bfloat16' out/pretrain/final/
 |truthfulqa_mc2                 |      2|none            |     0|acc        |↑  |0.5061|±  |0.0167|
 |winogrande                     |      1|none            |     0|acc        |↑  |0.4933|±  |0.0141|
 ```bash
 litgpt evaluate --tasks 'wikitext,qasper' --out_dir 'evaluate-long/' --batch_size 4 --dtype 'bfloat16' out/pretrain/final/
 ```