palicoqiqi
/

paligemma_vqav2_1

Generated from Trainer

Model card Files Files and versions Metrics Training metrics Community

palicoqiqi commited on 17 days ago

Commit

64e202a

•

1 Parent(s): 7767162

palicoqiqi/paligemma_VQAv2_enel645_1

Files changed (2) hide show

README.md +14 -14
adapter_model.safetensors +1 -1

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/paligemma-3b-pt-224](https://huggingface.co/google/paligemma-3b-pt-224) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.4158
 ## Model description
@@ -35,7 +35,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 2e-05
 - train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
@@ -50,18 +50,18 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch   | Step | Validation Loss |
 |:-------------:|:-------:|:----:|:---------------:|
-| No log        | 0.9976  | 318  | 0.3705          |
-| 1.1065        | 1.9984  | 637  | 0.2753          |
-| 1.1065        | 2.9992  | 956  | 0.2679          |
-| 0.2268        | 4.0     | 1275 | 0.2718          |
-| 0.1558        | 4.9976  | 1593 | 0.2638          |
-| 0.1558        | 5.9984  | 1912 | 0.2820          |
-| 0.1057        | 6.9992  | 2231 | 0.3018          |
-| 0.0623        | 8.0     | 2550 | 0.3286          |
-| 0.0623        | 8.9976  | 2868 | 0.3621          |
-| 0.0325        | 9.9984  | 3187 | 0.3919          |
-| 0.0191        | 10.9992 | 3506 | 0.4125          |
-| 0.0191        | 11.9718 | 3816 | 0.4158          |
 ### Framework versions

 This model is a fine-tuned version of [google/paligemma-3b-pt-224](https://huggingface.co/google/paligemma-3b-pt-224) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.6032
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 0.0001
 - train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
 | Training Loss | Epoch   | Step | Validation Loss |
 |:-------------:|:-------:|:----:|:---------------:|
+| No log        | 0.9976  | 318  | 0.2614          |
+| 0.511         | 1.9984  | 637  | 0.2837          |
+| 0.511         | 2.9992  | 956  | 0.3610          |
+| 0.1393        | 4.0     | 1275 | 0.3947          |
+| 0.0468        | 4.9976  | 1593 | 0.5137          |
+| 0.0468        | 5.9984  | 1912 | 0.6421          |
+| 0.0202        | 6.9992  | 2231 | 0.5855          |
+| 0.0126        | 8.0     | 2550 | 0.5457          |
+| 0.0126        | 8.9976  | 2868 | 0.5446          |
+| 0.0083        | 9.9984  | 3187 | 0.5436          |
+| 0.0062        | 10.9992 | 3506 | 0.5754          |
+| 0.0062        | 11.9718 | 3816 | 0.6032          |
 ### Framework versions

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:9dd72d788319fa64cd5be1f240fdc55857e220ff0c6c347ada45776ca6551cbe
 size 45258384

 version https://git-lfs.github.com/spec/v1
+oid sha256:cfabbe70419c9191bb24899ac1dcaf9ba73f903be9683a32fcfe5642e548e77d
 size 45258384