Update README.md
Browse files
README.md
CHANGED
@@ -54,7 +54,7 @@ Finetuned by Mr. Yunsung Ji (Saxo), a data scientist at Linkbricks, a company sp
|
|
54 |
CPT(Continue-Pretraining)->SFT->DPO training model based on gemma-2-27b-it through 8 H100-80Gs as a Korean language model <br>
|
55 |
It is a model that has been trained to handle Korean-Chinese-English-Japanese cross-training data and 10M korean news corpus and logic judgment data for various tasks to enable cross-fertilization processing and complex Korean logic & math problems. <br>
|
56 |
-Tokenizer uses the base model without word expansion<br>
|
57 |
-
-Models enhanced with high-dimensional analysis of customer reviews and social posts, as well as coding, writing,
|
58 |
-128k-Context Window<br>
|
59 |
-Deepspeed Stage=3, use rslora and BAdam Layer Mode<br>
|
60 |
<br><br>
|
|
|
54 |
CPT(Continue-Pretraining)->SFT->DPO training model based on gemma-2-27b-it through 8 H100-80Gs as a Korean language model <br>
|
55 |
It is a model that has been trained to handle Korean-Chinese-English-Japanese cross-training data and 10M korean news corpus and logic judgment data for various tasks to enable cross-fertilization processing and complex Korean logic & math problems. <br>
|
56 |
-Tokenizer uses the base model without word expansion<br>
|
57 |
+
-Models enhanced with high-dimensional analysis of customer reviews and social posts, as well as coding, writing, math and decision making<br>
|
58 |
-128k-Context Window<br>
|
59 |
-Deepspeed Stage=3, use rslora and BAdam Layer Mode<br>
|
60 |
<br><br>
|