mssongit commited on
Commit
9a4fd78
1 Parent(s): 6cf7d24

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +44 -21
README.md CHANGED
@@ -1,25 +1,48 @@
1
- KoAlpaca-Polyglot-12.8B (v1.1b)
2
- This model is a fine-tuned version of EleutherAI/polyglot-ko-12.8b on a KoAlpaca Dataset v1.1b
 
 
 
 
 
 
 
 
 
 
 
 
 
 
3
 
4
- Detail Codes are available at KoAlpaca Github Repository
5
 
6
- Training procedure
7
- Training hyperparameters
 
 
 
 
 
 
 
 
 
8
  The following hyperparameters were used during training:
 
 
 
 
 
 
 
 
 
 
 
 
 
9
 
10
- learning_rate: 5e-05
11
- train_batch_size: 1
12
- seed: 42
13
- distributed_type: multi-GPU (A100 80G)
14
- num_devices: 4
15
- gradient_accumulation_steps: 64
16
- total_train_batch_size: 256
17
- total_eval_batch_size: 32
18
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
19
- lr_scheduler_type: linear
20
- num_epochs: 2.0
21
- Framework versions
22
- Transformers 4.28.1
23
- Pytorch 2.0.0+cu117
24
- Datasets 2.11.0
25
- Tokenizers 0.13.3
 
1
+ ---
2
+ license: apache-2.0
3
+ tags:
4
+ - generated_from_trainer
5
+ - polyglot-ko
6
+ - gpt-neox
7
+ - KoAlpaca
8
+ model-index:
9
+ - name: KoAlpaca-Polyglot-12.8B
10
+ results: []
11
+ language:
12
+ - ko
13
+ datasets:
14
+ - KoAlpaca-v1.1b
15
+ pipeline_tag: text-generation
16
+ ---
17
 
 
18
 
19
+ # KoAlpaca-Polyglot-12.8B (v1.1b)
20
+
21
+ This model is a fine-tuned version of [EleutherAI/polyglot-ko-12.8b](https://huggingface.co/EleutherAI/polyglot-ko-12.8b) on a KoAlpaca Dataset v1.1b
22
+
23
+ Detail Codes are available at [KoAlpaca Github Repository](https://github.com/Beomi/KoAlpaca)
24
+
25
+
26
+ ## Training procedure
27
+
28
+ ### Training hyperparameters
29
+
30
  The following hyperparameters were used during training:
31
+ - learning_rate: 5e-05
32
+ - train_batch_size: 1
33
+ - seed: 42
34
+ - distributed_type: multi-GPU (A100 80G)
35
+ - num_devices: 4
36
+ - gradient_accumulation_steps: 64
37
+ - total_train_batch_size: 256
38
+ - total_eval_batch_size: 32
39
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
40
+ - lr_scheduler_type: linear
41
+ - num_epochs: 2.0
42
+
43
+ ### Framework versions
44
 
45
+ - Transformers 4.28.1
46
+ - Pytorch 2.0.0+cu117
47
+ - Datasets 2.11.0
48
+ - Tokenizers 0.13.3