danelcsb commited on Aug 9

Commit

041288b

•

1 Parent(s): 0062ca7

Model save

Browse files

Files changed (18) hide show

README.md +88 -0
config.json +82 -0
model.safetensors +3 -0
preprocessor_config.json +27 -0
runs/Aug08_23-59-04_4553f658f331/events.out.tfevents.1723161558.4553f658f331.26688.0 +3 -0
runs/Aug09_00-00-24_4553f658f331/events.out.tfevents.1723161638.4553f658f331.28126.0 +3 -0
runs/Aug09_00-04-29_4553f658f331/events.out.tfevents.1723161885.4553f658f331.29564.0 +3 -0
runs/Aug09_00-06-15_4553f658f331/events.out.tfevents.1723161989.4553f658f331.31003.0 +3 -0
runs/Aug09_00-08-23_4553f658f331/events.out.tfevents.1723162117.4553f658f331.32441.0 +3 -0
runs/Aug09_00-09-54_4553f658f331/events.out.tfevents.1723162210.4553f658f331.35864.0 +3 -0
runs/Aug09_00-11-01_4553f658f331/events.out.tfevents.1723162276.4553f658f331.37956.0 +3 -0
runs/Aug09_04-33-55_a44b5742ff80/events.out.tfevents.1723178053.a44b5742ff80.5133.0 +3 -0
runs/Aug09_04-42-08_a44b5742ff80/events.out.tfevents.1723178546.a44b5742ff80.6692.0 +3 -0
special_tokens_map.json +37 -0
tokenizer.json +0 -0
tokenizer_config.json +56 -0
training_args.bin +3 -0
vocab.txt +0 -0

README.md ADDED Viewed

	@@ -0,0 +1,88 @@

+---
+license: apache-2.0
+base_model: IDEA-Research/grounding-dino-tiny
+tags:
+- generated_from_trainer
+model-index:
+- name: grounding-dino-tiny-finetuned-cppe-5-10k-steps
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# grounding-dino-tiny-finetuned-cppe-5-10k-steps
+This model is a fine-tuned version of [IDEA-Research/grounding-dino-tiny](https://huggingface.co/IDEA-Research/grounding-dino-tiny) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 5.5317
+- Map: 0.0151
+- Map 50: 0.0275
+- Map 75: 0.0157
+- Map Small: 0.0125
+- Map Medium: 0.0149
+- Map Large: 0.0236
+- Mar 1: 0.0202
+- Mar 10: 0.0902
+- Mar 100: 0.1127
+- Mar Small: 0.0815
+- Mar Medium: 0.0975
+- Mar Large: 0.1461
+- Map Coverall: 0.0755
+- Mar 100 Coverall: 0.5636
+- Map Face Shield: 0.0
+- Mar 100 Face Shield: 0.0
+- Map Gloves: 0.0
+- Mar 100 Gloves: 0.0
+- Map Goggles: 0.0
+- Mar 100 Goggles: 0.0
+- Map Mask: 0.0
+- Mar 100 Mask: 0.0
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 5e-05
+- train_batch_size: 1
+- eval_batch_size: 1
+- seed: 1337
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- num_epochs: 10.0
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | Map    | Map 50 | Map 75 | Map Small | Map Medium | Map Large | Mar 1  | Mar 10 | Mar 100 | Mar Small | Mar Medium | Mar Large | Map Coverall | Mar 100 Coverall | Map Face Shield | Mar 100 Face Shield | Map Gloves | Mar 100 Gloves | Map Goggles | Mar 100 Goggles | Map Mask | Mar 100 Mask |
+|:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:----------:|:---------:|:------:|:------:|:-------:|:---------:|:----------:|:---------:|:------------:|:----------------:|:---------------:|:-------------------:|:----------:|:--------------:|:-----------:|:---------------:|:--------:|:------------:|
+| 8355.9482     | 1.0   | 850  | 6.6137          | 0.014  | 0.0272 | 0.0134 | 0.0048    | 0.0111     | 0.0243    | 0.0149 | 0.0893 | 0.1073  | 0.0523    | 0.0889     | 0.1328    | 0.0702       | 0.5366           | 0.0             | 0.0                 | 0.0        | 0.0            | 0.0         | 0.0             | 0.0      | 0.0          |
+| 6.7523        | 2.0   | 1700 | 6.2357          | 0.0162 | 0.0302 | 0.0148 | 0.0106    | 0.0189     | 0.0192    | 0.0247 | 0.0894 | 0.107   | 0.0643    | 0.0968     | 0.1258    | 0.0809       | 0.535            | 0.0             | 0.0                 | 0.0        | 0.0            | 0.0         | 0.0             | 0.0      | 0.0          |
+| 6.5566        | 3.0   | 2550 | 6.0890          | 0.0158 | 0.0294 | 0.0134 | 0.01      | 0.0199     | 0.0215    | 0.0222 | 0.0876 | 0.1065  | 0.0671    | 0.0846     | 0.1324    | 0.0791       | 0.5323           | 0.0             | 0.0                 | 0.0        | 0.0            | 0.0         | 0.0             | 0.0      | 0.0          |
+| 6.2217        | 4.0   | 3400 | 5.9028          | 0.0144 | 0.0271 | 0.0134 | 0.0066    | 0.0096     | 0.0225    | 0.0232 | 0.0857 | 0.107   | 0.06      | 0.0823     | 0.1397    | 0.0721       | 0.5348           | 0.0             | 0.0                 | 0.0        | 0.0            | 0.0         | 0.0             | 0.0      | 0.0          |
+| 6.0963        | 5.0   | 4250 | 5.8411          | 0.0126 | 0.0215 | 0.014  | 0.0055    | 0.0138     | 0.0178    | 0.0201 | 0.0811 | 0.1052  | 0.044     | 0.0942     | 0.1377    | 0.0631       | 0.5258           | 0.0             | 0.0                 | 0.0        | 0.0            | 0.0         | 0.0             | 0.0      | 0.0          |
+| 5.996         | 6.0   | 5100 | 5.7244          | 0.0162 | 0.0311 | 0.0166 | 0.0059    | 0.0145     | 0.0221    | 0.0223 | 0.0869 | 0.1088  | 0.0667    | 0.0919     | 0.1328    | 0.0812       | 0.5437           | 0.0             | 0.0                 | 0.0        | 0.0            | 0.0         | 0.0             | 0.0      | 0.0          |
+| 5.8971        | 7.0   | 5950 | 5.5473          | 0.0154 | 0.027  | 0.016  | 0.01      | 0.014      | 0.0208    | 0.0244 | 0.0946 | 0.1154  | 0.084     | 0.106      | 0.1311    | 0.0769       | 0.5772           | 0.0             | 0.0                 | 0.0        | 0.0            | 0.0         | 0.0             | 0.0      | 0.0          |
+| 5.7451        | 8.0   | 6800 | 5.5231          | 0.0146 | 0.0267 | 0.0148 | 0.0021    | 0.0161     | 0.0183    | 0.0256 | 0.0905 | 0.1125  | 0.0325    | 0.1062     | 0.128     | 0.0731       | 0.5624           | 0.0             | 0.0                 | 0.0        | 0.0            | 0.0         | 0.0             | 0.0      | 0.0          |
+| 5.7931        | 9.0   | 7650 | 5.5190          | 0.0182 | 0.032  | 0.0195 | 0.0175    | 0.0147     | 0.0249    | 0.0299 | 0.1048 | 0.1138  | 0.08      | 0.0945     | 0.1309    | 0.091        | 0.5688           | 0.0             | 0.0                 | 0.0        | 0.0            | 0.0         | 0.0             | 0.0      | 0.0          |
+| 5.7435        | 10.0  | 8500 | 5.5317          | 0.0151 | 0.0275 | 0.0157 | 0.0125    | 0.0149     | 0.0236    | 0.0202 | 0.0902 | 0.1127  | 0.0815    | 0.0975     | 0.1461    | 0.0755       | 0.5636           | 0.0             | 0.0                 | 0.0        | 0.0            | 0.0         | 0.0             | 0.0      | 0.0          |
+### Framework versions
+- Transformers 4.45.0.dev0
+- Pytorch 2.2.2
+- Datasets 2.20.0
+- Tokenizers 0.19.1

config.json ADDED Viewed

	@@ -0,0 +1,82 @@

+{
+  "_name_or_path": "IDEA-Research/grounding-dino-tiny",
+  "activation_dropout": 0.0,
+  "activation_function": "relu",
+  "architectures": [
+    "GroundingDinoForObjectDetection"
+  ],
+  "attention_dropout": 0.0,
+  "auxiliary_loss": false,
+  "backbone": null,
+  "backbone_config": {
+    "model_type": "swin",
+    "out_features": [
+      "stage2",
+      "stage3",
+      "stage4"
+    ],
+    "out_indices": [
+      2,
+      3,
+      4
+    ]
+  },
+  "backbone_kwargs": null,
+  "bbox_cost": 5.0,
+  "bbox_loss_coefficient": 5.0,
+  "class_cost": 1.0,
+  "class_loss_coefficient": 2.0,
+  "class_loss_reduction": "sum",
+  "d_model": 256,
+  "decoder_attention_heads": 8,
+  "decoder_bbox_embed_share": true,
+  "decoder_ffn_dim": 2048,
+  "decoder_layers": 6,
+  "decoder_n_points": 4,
+  "disable_custom_kernels": false,
+  "dropout": 0.1,
+  "embedding_init_target": true,
+  "encoder_attention_heads": 8,
+  "encoder_ffn_dim": 2048,
+  "encoder_layers": 6,
+  "encoder_n_points": 4,
+  "focal_alpha": 0.25,
+  "fusion_dropout": 0.0,
+  "fusion_droppath": 0.1,
+  "giou_cost": 2.0,
+  "giou_loss_coefficient": 2.0,
+  "id2label": {
+    "0": "Coverall",
+    "1": "Face_Shield",
+    "2": "Gloves",
+    "3": "Goggles",
+    "4": "Mask"
+  },
+  "init_std": 0.02,
+  "is_encoder_decoder": true,
+  "label2id": {
+    "Coverall": 0,
+    "Face_Shield": 1,
+    "Gloves": 2,
+    "Goggles": 3,
+    "Mask": 4
+  },
+  "layer_norm_eps": 1e-05,
+  "max_text_len": 256,
+  "model_type": "grounding-dino",
+  "num_feature_levels": 4,
+  "num_queries": 900,
+  "position_embedding_type": "sine",
+  "positional_embedding_temperature": 20,
+  "query_dim": 4,
+  "text_config": {
+    "model_type": "bert"
+  },
+  "text_enhancer_dropout": 0.0,
+  "torch_dtype": "float32",
+  "transformers_version": "4.45.0.dev0",
+  "two_stage": true,
+  "two_stage_bbox_embed_share": false,
+  "use_pretrained_backbone": false,
+  "use_timm_backbone": false
+}

model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:787fe7c9de7d70cb3a0bbe8b1cab074f3668e0061211104b814c83a49edbc33d
+size 689359096

preprocessor_config.json ADDED Viewed

	@@ -0,0 +1,27 @@

+{
+  "do_convert_annotations": true,
+  "do_normalize": true,
+  "do_pad": true,
+  "do_rescale": true,
+  "do_resize": true,
+  "format": "coco_detection",
+  "image_mean": [
+    0.485,
+    0.456,
+    0.406
+  ],
+  "image_processor_type": "GroundingDinoImageProcessor",
+  "image_std": [
+    0.229,
+    0.224,
+    0.225
+  ],
+  "pad_size": null,
+  "processor_class": "GroundingDinoProcessor",
+  "resample": 2,
+  "rescale_factor": 0.00392156862745098,
+  "size": {
+    "longest_edge": 1333,
+    "shortest_edge": 800
+  }
+}

runs/Aug08_23-59-04_4553f658f331/events.out.tfevents.1723161558.4553f658f331.26688.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:31b381e3ce13c2917d292f572f013f1bb10666510221575adb6fb5e68b33f42e
+size 6527

runs/Aug09_00-00-24_4553f658f331/events.out.tfevents.1723161638.4553f658f331.28126.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:657745056f24983870e85a41908110fe6440c7332488c11ad04d218aaae4b3ab
+size 6527

runs/Aug09_00-04-29_4553f658f331/events.out.tfevents.1723161885.4553f658f331.29564.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:50e204141f07b59e9604b535cbc3176368575864e926b8403fd93769e3fb3b02
+size 6527

runs/Aug09_00-06-15_4553f658f331/events.out.tfevents.1723161989.4553f658f331.31003.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:defdcd94d7c40a160d336a66e9d94125f7a0d278d2252cecf725346130d81bf1
+size 6527

runs/Aug09_00-08-23_4553f658f331/events.out.tfevents.1723162117.4553f658f331.32441.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:ec4459f47197437c727f28d50eb4296ddb122fd6da4055172a0ef649e6c0c64b
+size 9807

runs/Aug09_00-09-54_4553f658f331/events.out.tfevents.1723162210.4553f658f331.35864.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:bba9dfbf5143dc0bc2fbb5c78f6c9694332394c4d937e798d1ac107c7a95c6e3
+size 7961

runs/Aug09_00-11-01_4553f658f331/events.out.tfevents.1723162276.4553f658f331.37956.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:eebdbae3f46203b8dc71eea34601b4e6bc2dad2d56cea90b438134b889cd58bb
+size 12880

runs/Aug09_04-33-55_a44b5742ff80/events.out.tfevents.1723178053.a44b5742ff80.5133.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:79a8eeb2dd6c9a3e511465feaca41cae7a19445c7f12439c4a1a64e6363cd986
+size 7991

runs/Aug09_04-42-08_a44b5742ff80/events.out.tfevents.1723178546.a44b5742ff80.6692.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:f489a2cae806af50c8456ce8eb6393f344e72c6f19c33ab029e0d3f2d275d92e
+size 23385

special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,37 @@

+{
+  "cls_token": {
+    "content": "[CLS]",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "mask_token": {
+    "content": "[MASK]",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "pad_token": {
+    "content": "[PAD]",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "sep_token": {
+    "content": "[SEP]",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "unk_token": {
+    "content": "[UNK]",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  }
+}

tokenizer.json ADDED Viewed

The diff for this file is too large to render. See raw diff

tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,56 @@

+{
+  "added_tokens_decoder": {
+    "0": {
+      "content": "[PAD]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "100": {
+      "content": "[UNK]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "101": {
+      "content": "[CLS]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "102": {
+      "content": "[SEP]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "103": {
+      "content": "[MASK]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    }
+  },
+  "clean_up_tokenization_spaces": true,
+  "cls_token": "[CLS]",
+  "do_lower_case": true,
+  "mask_token": "[MASK]",
+  "model_max_length": 512,
+  "pad_token": "[PAD]",
+  "processor_class": "GroundingDinoProcessor",
+  "sep_token": "[SEP]",
+  "strip_accents": null,
+  "tokenize_chinese_chars": true,
+  "tokenizer_class": "BertTokenizer",
+  "unk_token": "[UNK]"
+}

training_args.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:25039c49df114f4fad99be56baee1461adb2a82290ece3a9f9f27957a49cacfe
+size 5368

vocab.txt ADDED Viewed

The diff for this file is too large to render. See raw diff