Merge branch 'main' of hf.co:yuzc19/pythia-410m-mates
Browse files
README.md
CHANGED
@@ -6,7 +6,7 @@ datasets:
|
|
6 |
|
7 |
*Pythia-410M models pre-trained by MATES.*
|
8 |
|
9 |
-
The
|
10 |
|
11 |
Paper: [MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models](https://arxiv.org/pdf/2406.06046)
|
12 |
|
|
|
6 |
|
7 |
*Pythia-410M models pre-trained by MATES.*
|
8 |
|
9 |
+
The training step is the iteration divided by 4, i.e., iter-040000-ckpt.pth corresponds to the model checkpoint in step 10000.
|
10 |
|
11 |
Paper: [MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models](https://arxiv.org/pdf/2406.06046)
|
12 |
|