svenbl80
/

finetune_colpali_v1_2-german-4bit

Transformers

Safetensors

ColPali

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

svenbl80 commited on about 8 hours ago

Commit

c74b636

•

1 Parent(s): 8b2f73f

End of training

Browse files

Files changed (3) hide show

README.md +14 -23
adapter_model.safetensors +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -15,10 +15,10 @@ should probably proofread and complete it, then remove this comment. -->
 # finetune_colpali_v1_2-german-4bit
-This model is a fine-tuned version of [vidore/colpaligemma-3b-pt-448-base](https://huggingface.co/vidore/colpaligemma-3b-pt-448-base) on the vidore/vdsid_french dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.0594
-- Model Preparation Time: 0.0095
 ## Model description
@@ -46,31 +46,22 @@ The following hyperparameters were used during training:
 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 100
-- num_epochs: 10
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss | Model Preparation Time |
 |:-------------:|:------:|:----:|:---------------:|:----------------------:|
-| No log        | 0.0533 | 1    | 0.2735          | 0.0095                 |
-| 1.7269        | 0.5333 | 10   | 0.2216          | 0.0095                 |
-| 1.568         | 1.0667 | 20   | 0.1536          | 0.0095                 |
-| 0.8782        | 1.6    | 30   | 0.1232          | 0.0095                 |
-| 0.7092        | 2.1333 | 40   | 0.1032          | 0.0095                 |
-| 0.4526        | 2.6667 | 50   | 0.0762          | 0.0095                 |
-| 0.5601        | 3.2    | 60   | 0.0663          | 0.0095                 |
-| 0.3721        | 3.7333 | 70   | 0.0591          | 0.0095                 |
-| 0.2704        | 4.2667 | 80   | 0.0456          | 0.0095                 |
-| 0.3564        | 4.8    | 90   | 0.0435          | 0.0095                 |
-| 0.2019        | 5.3333 | 100  | 0.0390          | 0.0095                 |
-| 0.1092        | 5.8667 | 110  | 0.0337          | 0.0095                 |
-| 0.0884        | 6.4    | 120  | 0.0344          | 0.0095                 |
-| 0.2341        | 6.9333 | 130  | 0.0433          | 0.0095                 |
-| 0.1872        | 7.4667 | 140  | 0.0448          | 0.0095                 |
-| 0.1533        | 8.0    | 150  | 0.0485          | 0.0095                 |
-| 0.1681        | 8.5333 | 160  | 0.0525          | 0.0095                 |
-| 0.2414        | 9.0667 | 170  | 0.0590          | 0.0095                 |
-| 0.1814        | 9.6    | 180  | 0.0594          | 0.0095                 |
 ### Framework versions

 # finetune_colpali_v1_2-german-4bit
+This model is a fine-tuned version of [vidore/colpaligemma-3b-pt-448-base](https://huggingface.co/vidore/colpaligemma-3b-pt-448-base) on the German_docx dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1100
+- Model Preparation Time: 0.008
 ## Model description
 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 100
+- num_epochs: 5
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss | Model Preparation Time |
 |:-------------:|:------:|:----:|:---------------:|:----------------------:|
+| No log        | 0.0533 | 1    | 0.3717          | 0.008                  |
+| 1.1358        | 0.5333 | 10   | 0.3356          | 0.008                  |
+| 1.2182        | 1.0667 | 20   | 0.2811          | 0.008                  |
+| 0.844         | 1.6    | 30   | 0.2365          | 0.008                  |
+| 0.7722        | 2.1333 | 40   | 0.1990          | 0.008                  |
+| 0.4823        | 2.6667 | 50   | 0.1758          | 0.008                  |
+| 0.46          | 3.2    | 60   | 0.1451          | 0.008                  |
+| 0.1477        | 3.7333 | 70   | 0.1252          | 0.008                  |
+| 0.1764        | 4.2667 | 80   | 0.1258          | 0.008                  |
+| 0.2329        | 4.8    | 90   | 0.1100          | 0.008                  |
 ### Framework versions

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:649b38b9a644337585ae1d9fdbcf355961ea171eb5d6a2d381ea9f3df5b54ef2
 size 157071680

 version https://git-lfs.github.com/spec/v1
+oid sha256:7a7c8c840d5514721e64e756d0e0c35897eda4348d6d8fa2d0a258bf1cb86e47
 size 157071680

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:0560019392421fe2199c17a53c2019b0e26834c8eca6f6aa8f07d5badbe34692
 size 5240

 version https://git-lfs.github.com/spec/v1
+oid sha256:07741319549368dc90fb30caeb2bf231216e51d35276e55123323b45017e06d4
 size 5240