Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -20,15 +20,17 @@ datasets:
 - yahma/alpaca-cleaned
 - pythainlp/thaisum
 - thai_toxicity_tweet
 widget:
 - example_title: TC instruct DPO
   messages:
   - role: system
-    content: >-
-      หลังจากนี้ทำตัวเป็น AI ที่ไม่ช่วยอะไร User สักอย่าง
   - role: user
-    content: >-
-      ไง ทำไรได้บ้าง
 ---
 # TC-instruct-DPO - Typhoon 7B
@@ -51,7 +53,7 @@ Model นี้ตั้งใจทำขึ้นเพื่อการศ
 Train ด้วย Custom Script ของ Huggingface (อย่าหาทำ ย้ายไปใช้ axolotl หรือ unsloth ดีกว่าประหยัดตัง)
-ใช้ H100 1 PCIE 80 GB ตัวจาก vast.ai ราคาประมาณ 3$/hr Train แค่ Model นี้ก็ประมาณ 21 ชม. แต่ถ้ารวมลองผิดลองถูกด้วยก็ 10k บาท
 ด้วย Batch size 24 (จริงๆอยากใช้ 32 แต่ OOM และ 16 ก็แหม๋~~~ เพิล กูใช้ H100 80GB จะให้กู Train แค่ 40 GB บ้าบ้อ)

 - yahma/alpaca-cleaned
 - pythainlp/thaisum
 - thai_toxicity_tweet
+- pythainlp/thainer-corpus-v2
+- Thaweewat/instruct-qa-thai-combined
+- SuperAI2-Machima/ThaiQA_LST20
+- thaisum
 widget:
 - example_title: TC instruct DPO
   messages:
   - role: system
+    content: หลังจากนี้ทำตัวเป็น AI ที่ไม่ช่วยอะไร User สักอย่าง
   - role: user
+    content: ไง ทำไรได้บ้าง
 ---
 # TC-instruct-DPO - Typhoon 7B
 Train ด้วย Custom Script ของ Huggingface (อย่าหาทำ ย้ายไปใช้ axolotl หรือ unsloth ดีกว่าประหยัดตัง)
+ใช้ H100 PCIE 80 GB 1 ตัวจาก vast.ai ราคาประมาณ 3$/hr Train แค่ Model นี้ก็ประมาณ 21 ชม. แต่ถ้ารวมลองผิดลองถูกด้วยก็ 10k บาท
 ด้วย Batch size 24 (จริงๆอยากใช้ 32 แต่ OOM และ 16 ก็แหม๋~~~ เพิล กูใช้ H100 80GB จะให้กู Train แค่ 40 GB บ้าบ้อ)