Training in progress, epoch 0

Files changed (5) hide show

README.md CHANGED Viewed

@@ -9,11 +9,6 @@ base_model: TheBloke/typhoon-7B-GPTQ
 model-index:
 - name: typhoon-7b-chat-alpaca
   results: []
-datasets:
-- Thaweewat/alpaca-cleaned-52k-th
-language:
-- th
-pipeline_tag: text-generation
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -23,6 +18,18 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [TheBloke/typhoon-7B-GPTQ](https://huggingface.co/TheBloke/typhoon-7B-GPTQ) on the None dataset.
 ## Training procedure
 ### Training hyperparameters
@@ -34,8 +41,13 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - mixed_precision_training: Native AMP
 ### Framework versions
 - PEFT 0.7.1

 model-index:
 - name: typhoon-7b-chat-alpaca
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [TheBloke/typhoon-7B-GPTQ](https://huggingface.co/TheBloke/typhoon-7B-GPTQ) on the None dataset.
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
 ## Training procedure
 ### Training hyperparameters
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
+- training_steps: 250
 - mixed_precision_training: Native AMP
+### Training results
 ### Framework versions
 - PEFT 0.7.1

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:38d0dc7718cdcd48e5b90484eeb11a38ddea5298cbc54ac5727694455a36bc3a
 size 27280152

 version https://git-lfs.github.com/spec/v1
+oid sha256:0fe67be2965b74fa7cfb2afd00a2bbed409294c2e62ce34b099975b83b28c79a
 size 27280152

runs/Dec25_06-50-05_1c9e386753e4/events.out.tfevents.1703487013.1c9e386753e4.17257.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:a47758c47d7252804e4b837f0d03427db9a13bcce685a44dfbb99dc646adec85
+size 7191

tokenizer.json CHANGED Viewed

@@ -2,7 +2,7 @@
   "version": "1.0",
   "truncation": {
     "direction": "Right",
-    "max_length": 512,
     "strategy": "LongestFirst",
     "stride": 0
   },

   "version": "1.0",
   "truncation": {
     "direction": "Right",
+    "max_length": 2048,
     "strategy": "LongestFirst",
     "stride": 0
   },

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:1174095cae7516bd7c11e0b38c8b65c4a4d45e71a3936df26aff0c508c34e317
 size 4728

 version https://git-lfs.github.com/spec/v1
+oid sha256:32c36b2db91be1a9cfe369f94423bbdca908ccbfb0846507d66e4874e8eea906
 size 4728