End of training

Browse files

Files changed (4) hide show

README.md +1 -1
adapter_config.json +2 -2
trainer_peft.log +126 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -12,7 +12,7 @@ model-index:
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/noc-lab/PMC_LLAMA2_7B_trainer_lora/runs/n09numov)
 # PMC_LLAMA2_7B_trainer_lora
 This model is a fine-tuned version of [chaoyi-wu/PMC_LLAMA_7B](https://huggingface.co/chaoyi-wu/PMC_LLAMA_7B) on an unknown dataset.

 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/noc-lab/PMC_LLAMA2_7B_trainer_lora/runs/pvbcl0q5)
 # PMC_LLAMA2_7B_trainer_lora
 This model is a fine-tuned version of [chaoyi-wu/PMC_LLAMA_7B](https://huggingface.co/chaoyi-wu/PMC_LLAMA_7B) on an unknown dataset.

adapter_config.json CHANGED Viewed

@@ -20,8 +20,8 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "q_proj",
-    "v_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "v_proj",
+    "q_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

trainer_peft.log CHANGED Viewed

@@ -123,3 +123,129 @@
 2024-06-01 14:51 - Start training!!
 2024-06-01 14:51 - Start training!!
 2024-06-01 15:49 - Training complete!!!

 2024-06-01 14:51 - Start training!!
 2024-06-01 14:51 - Start training!!
 2024-06-01 15:49 - Training complete!!!
+2024-06-01 15:49 - Training complete!!!
+2024-06-01 20:49 - Cuda check
+2024-06-01 20:49 - True
+2024-06-01 20:49 - 2
+2024-06-01 20:49 - Configue Model and tokenizer
+2024-06-01 20:49 - Cuda check
+2024-06-01 20:49 - True
+2024-06-01 20:49 - 2
+2024-06-01 20:49 - Configue Model and tokenizer
+2024-06-01 20:49 - Memory usage in   0.00 GB
+2024-06-01 20:49 - Memory usage in   0.00 GB
+2024-06-01 20:49 - Dataset loaded successfully:
+ train-Jingmei/Pandemic_Wiki
+ test -Jingmei/Pandemic
+2024-06-01 20:49 - Dataset loaded successfully:
+ train-Jingmei/Pandemic_Wiki
+ test -Jingmei/Pandemic
+2024-06-01 20:49 - Tokenize data: DatasetDict({
+    train: Dataset({
+        features: ['input_ids', 'attention_mask'],
+        num_rows: 2152
+    })
+    test: Dataset({
+        features: ['input_ids', 'attention_mask'],
+        num_rows: 8264
+    })
+})
+2024-06-01 20:49 - Tokenize data: DatasetDict({
+    train: Dataset({
+        features: ['input_ids', 'attention_mask'],
+        num_rows: 2152
+    })
+    test: Dataset({
+        features: ['input_ids', 'attention_mask'],
+        num_rows: 8264
+    })
+})
+2024-06-01 20:49 - Split data into chunks:DatasetDict({
+    train: Dataset({
+        features: ['input_ids', 'attention_mask'],
+        num_rows: 24863
+    })
+    test: Dataset({
+        features: ['input_ids', 'attention_mask'],
+        num_rows: 198964
+    })
+})
+2024-06-01 20:49 - Setup PEFT
+2024-06-01 20:49 - Split data into chunks:DatasetDict({
+    train: Dataset({
+        features: ['input_ids', 'attention_mask'],
+        num_rows: 24863
+    })
+    test: Dataset({
+        features: ['input_ids', 'attention_mask'],
+        num_rows: 198964
+    })
+})
+2024-06-01 20:49 - Setup PEFT
+2024-06-01 20:49 - Setup optimizer
+2024-06-01 20:49 - Setup optimizer
+2024-06-01 20:49 - Start training!!
+2024-06-01 20:49 - Start training!!
+2024-06-01 20:55 - Cuda check
+2024-06-01 20:55 - True
+2024-06-01 20:55 - 2
+2024-06-01 20:55 - Configue Model and tokenizer
+2024-06-01 20:55 - Cuda check
+2024-06-01 20:55 - True
+2024-06-01 20:55 - 2
+2024-06-01 20:55 - Configue Model and tokenizer
+2024-06-01 20:55 - Memory usage in   0.00 GB
+2024-06-01 20:55 - Memory usage in   0.00 GB
+2024-06-01 20:55 - Dataset loaded successfully:
+ train-Jingmei/Pandemic_Wiki
+ test -Jingmei/Pandemic
+2024-06-01 20:55 - Tokenize data: DatasetDict({
+    train: Dataset({
+        features: ['input_ids', 'attention_mask'],
+        num_rows: 2152
+    })
+    test: Dataset({
+        features: ['input_ids', 'attention_mask'],
+        num_rows: 8264
+    })
+})
+2024-06-01 20:55 - Dataset loaded successfully:
+ train-Jingmei/Pandemic_Wiki
+ test -Jingmei/Pandemic
+2024-06-01 20:55 - Split data into chunks:DatasetDict({
+    train: Dataset({
+        features: ['input_ids', 'attention_mask'],
+        num_rows: 24863
+    })
+    test: Dataset({
+        features: ['input_ids', 'attention_mask'],
+        num_rows: 198964
+    })
+})
+2024-06-01 20:55 - Setup PEFT
+2024-06-01 20:55 - Tokenize data: DatasetDict({
+    train: Dataset({
+        features: ['input_ids', 'attention_mask'],
+        num_rows: 2152
+    })
+    test: Dataset({
+        features: ['input_ids', 'attention_mask'],
+        num_rows: 8264
+    })
+})
+2024-06-01 20:55 - Split data into chunks:DatasetDict({
+    train: Dataset({
+        features: ['input_ids', 'attention_mask'],
+        num_rows: 24863
+    })
+    test: Dataset({
+        features: ['input_ids', 'attention_mask'],
+        num_rows: 198964
+    })
+})
+2024-06-01 20:55 - Setup PEFT
+2024-06-01 20:55 - Setup optimizer
+2024-06-01 20:55 - Setup optimizer
+2024-06-01 20:55 - Continue  training!!
+2024-06-01 20:55 - Continue  training!!
+2024-06-01 20:56 - Training complete!!!

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:44addaf9c8b314d3a88a7b01508035d34a4e18244af21ea2cde47f3d51ac0894
 size 5176

 version https://git-lfs.github.com/spec/v1
+oid sha256:f504f95f70f5d19e6e4e48a5e120ce2676ce8241842d7143d46d087125307096
 size 5176