Model save

Files changed (5) hide show

README.md CHANGED Viewed

@@ -11,7 +11,7 @@ model-index:
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/dongfu/Mantis/runs/canpdqed)
 # mantis-8b-idefics2-video-eval-50k_4096
 This model is a fine-tuned version of [HuggingFaceM4/idefics2-8b](https://huggingface.co/HuggingFaceM4/idefics2-8b) on an unknown dataset.
@@ -33,7 +33,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 1e-05
 - train_batch_size: 1
 - eval_batch_size: 1
 - seed: 42
@@ -45,7 +45,7 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_ratio: 0.03
-- num_epochs: 2.0
 ### Training results

 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/dongfu/Mantis/runs/0ssto7ph)
 # mantis-8b-idefics2-video-eval-50k_4096
 This model is a fine-tuned version of [HuggingFaceM4/idefics2-8b](https://huggingface.co/HuggingFaceM4/idefics2-8b) on an unknown dataset.
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 5e-06
 - train_batch_size: 1
 - eval_batch_size: 1
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_ratio: 0.03
+- num_epochs: 1.0
 ### Training results

model-00001-of-00004.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:230d362270462a21636821044b53a2747e5cc06b672a109f1624d0cd012a6fdf
 size 4966706832

 version https://git-lfs.github.com/spec/v1
+oid sha256:155c09cc3348a1627ac8623d962f545cd22600c9d75905738b815257f0aed8c2
 size 4966706832

model-00002-of-00004.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:42919a5d75a03e32e613e8333960dade0f71f5d4f01d935b974cbfb3477652c0
 size 4915917232

 version https://git-lfs.github.com/spec/v1
+oid sha256:610a4d2ab9bd94b8ebfcc1ebfa741ca178b0bc3227ca77731e1a79acd71fe3bf
 size 4915917232

model-00003-of-00004.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:ca0bacb4d8f00c9e38d0d60127f7b56a642b3bb66073ca897ef4943276bccb59
 size 4999820504

 version https://git-lfs.github.com/spec/v1
+oid sha256:ce8c2fd31662d85be030d0eb02d09a6b322dc051487564978f3f9fa05f9436a7
 size 4999820504

model-00004-of-00004.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:20c240e67bfb92c45620b4ad0ec5ff9fbd4528efc527712224f912617ae2bbe5
 size 1923190976

 version https://git-lfs.github.com/spec/v1
+oid sha256:b50bc4f69217d00b81d4aa3345c1ba5f70083f07f935062dd5829a3c58a44347
 size 1923190976