kalomaze commited on
Commit
e7b159b
1 Parent(s): e7766e1

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -0
README.md CHANGED
@@ -1,3 +1,10 @@
 
 
 
 
 
 
 
1
  # 70b Distillation Experiment
2
  This is not the full-fledged run that I plan to do for a large scale distillation of Llama3 70b.
3
  Instead, it's a preliminary test train of the custom distillation trainer, where we target KL divergence from the larger Llama3 70b teacher model onto 4x8b (the student).
 
1
+ ---
2
+ license: llama3
3
+ language:
4
+ - en
5
+ tags:
6
+ - code
7
+ ---
8
  # 70b Distillation Experiment
9
  This is not the full-fledged run that I plan to do for a large scale distillation of Llama3 70b.
10
  Instead, it's a preliminary test train of the custom distillation trainer, where we target KL divergence from the larger Llama3 70b teacher model onto 4x8b (the student).