HyperionHF
commited on
Commit
•
8806fce
1
Parent(s):
a52cb6f
Update README.md
Browse files
README.md
CHANGED
@@ -33,7 +33,7 @@ Our dataset for this fine-tune consists of commits from GitHub, obtained using t
|
|
33 |
|
34 |
Our diff model was trained on a dataset of commits from BigQuery, a large-scale dataset of many programming languages from GitHub repositories. We filtered the dataset by the number of stars in the repository (>100 stars), license (only open-source non-copyleft licensed code included), and length of file (files greater than 2048 tokens in length were excluded).
|
35 |
|
36 |
-
The model was trained using the
|
37 |
|
38 |
## Training Details
|
39 |
|
|
|
33 |
|
34 |
Our diff model was trained on a dataset of commits from BigQuery, a large-scale dataset of many programming languages from GitHub repositories. We filtered the dataset by the number of stars in the repository (>100 stars), license (only open-source non-copyleft licensed code included), and length of file (files greater than 2048 tokens in length were excluded).
|
35 |
|
36 |
+
The model was trained using the Huggingface Codegen tokenizer.
|
37 |
|
38 |
## Training Details
|
39 |
|