Gutema
/

layoutlm-funsd-tf

@@ -1,59 +1,66 @@
----
-library_name: transformers
-license: mit
-base_model: microsoft/layoutlm-base-uncased
-tags:
-- generated_from_keras_callback
-model-index:
-- name: Gutema/layoutlm-funsd-tf
-  results: []
----
-<!-- This model card has been generated automatically according to the information Keras had access to. You should
-probably proofread and complete it, then remove this comment. -->
-# Gutema/layoutlm-funsd-tf
-This model is a fine-tuned version of [microsoft/layoutlm-base-uncased](https://huggingface.co/microsoft/layoutlm-base-uncased) on an unknown dataset.
-It achieves the following results on the evaluation set:
-- Train Loss: 1.6885
-- Validation Loss: 1.3544
-- Train Overall Precision: 0.2506
-- Train Overall Recall: 0.2534
-- Train Overall F1: 0.2520
-- Train Overall Accuracy: 0.5600
-- Epoch: 0
-## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
-## Training procedure
-### Training hyperparameters
-The following hyperparameters were used during training:
-- optimizer: {'name': 'AdamWeightDecay', 'learning_rate': np.float32(3e-05), 'decay': 0.0, 'beta_1': np.float32(0.9), 'beta_2': np.float32(0.999), 'epsilon': 1e-07, 'amsgrad': False, 'weight_decay_rate': 0.01}
-- training_precision: mixed_float16
-### Training results
-| Train Loss | Validation Loss | Train Overall Precision | Train Overall Recall | Train Overall F1 | Train Overall Accuracy | Epoch |
-|:----------:|:---------------:|:-----------------------:|:--------------------:|:----------------:|:----------------------:|:-----:|
-| 1.6885     | 1.3544          | 0.2506                  | 0.2534               | 0.2520           | 0.5600                 | 0     |
-### Framework versions
-- Transformers 4.46.0
-- TensorFlow 2.18.0
-- Datasets 3.0.2
-- Tokenizers 0.20.1

+---
+library_name: transformers
+license: mit
+base_model: microsoft/layoutlm-base-uncased
+tags:
+- generated_from_keras_callback
+model-index:
+- name: layoutlm-funsd-tf
+  results: []
+---
+<!-- This model card has been generated automatically according to the information Keras had access to. You should
+probably proofread and complete it, then remove this comment. -->
+# layoutlm-funsd-tf
+This model is a fine-tuned version of [microsoft/layoutlm-base-uncased](https://huggingface.co/microsoft/layoutlm-base-uncased) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Train Loss: 0.2409
+- Validation Loss: 0.6905
+- Train Overall Precision: 0.7213
+- Train Overall Recall: 0.7858
+- Train Overall F1: 0.7522
+- Train Overall Accuracy: 0.7998
+- Epoch: 7
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- optimizer: {'name': 'AdamWeightDecay', 'learning_rate': np.float32(3e-05), 'decay': 0.0, 'beta_1': np.float32(0.9), 'beta_2': np.float32(0.999), 'epsilon': 1e-07, 'amsgrad': False, 'weight_decay_rate': 0.01}
+- training_precision: mixed_float16
+### Training results
+| Train Loss | Validation Loss | Train Overall Precision | Train Overall Recall | Train Overall F1 | Train Overall Accuracy | Epoch |
+|:----------:|:---------------:|:-----------------------:|:--------------------:|:----------------:|:----------------------:|:-----:|
+| 1.6885     | 1.3544          | 0.2506                  | 0.2534               | 0.2520           | 0.5600                 | 0     |
+| 1.1672     | 0.8949          | 0.5489                  | 0.5996               | 0.5731           | 0.7115                 | 1     |
+| 0.7766     | 0.7345          | 0.6327                  | 0.7476               | 0.6854           | 0.7633                 | 2     |
+| 0.5690     | 0.6435          | 0.6684                  | 0.7727               | 0.7168           | 0.7936                 | 3     |
+| 0.4387     | 0.6493          | 0.7180                  | 0.7792               | 0.7474           | 0.7962                 | 4     |
+| 0.3470     | 0.6380          | 0.7147                  | 0.7893               | 0.7501           | 0.8020                 | 5     |
+| 0.2901     | 0.6569          | 0.7317                  | 0.7868               | 0.7582           | 0.8059                 | 6     |
+| 0.2409     | 0.6905          | 0.7213                  | 0.7858               | 0.7522           | 0.7998                 | 7     |
+### Framework versions
+- Transformers 4.46.0
+- TensorFlow 2.18.0
+- Datasets 3.0.2
+- Tokenizers 0.20.1

config.json CHANGED Viewed

@@ -1,43 +1,43 @@
-{
-  "_name_or_path": "microsoft/layoutlm-base-uncased",
-  "architectures": [
-    "LayoutLMForTokenClassification"
-  ],
-  "attention_probs_dropout_prob": 0.1,
-  "hidden_act": "gelu",
-  "hidden_dropout_prob": 0.1,
-  "hidden_size": 768,
-  "id2label": {
-    "0": "O",
-    "1": "B-HEADER",
-    "2": "I-HEADER",
-    "3": "B-QUESTION",
-    "4": "I-QUESTION",
-    "5": "B-ANSWER",
-    "6": "I-ANSWER"
-  },
-  "initializer_range": 0.02,
-  "intermediate_size": 3072,
-  "label2id": {
-    "B-ANSWER": 5,
-    "B-HEADER": 1,
-    "B-QUESTION": 3,
-    "I-ANSWER": 6,
-    "I-HEADER": 2,
-    "I-QUESTION": 4,
-    "O": 0
-  },
-  "layer_norm_eps": 1e-12,
-  "max_2d_position_embeddings": 1024,
-  "max_position_embeddings": 512,
-  "model_type": "layoutlm",
-  "num_attention_heads": 12,
-  "num_hidden_layers": 12,
-  "output_past": true,
-  "pad_token_id": 0,
-  "position_embedding_type": "absolute",
-  "transformers_version": "4.46.0",
-  "type_vocab_size": 2,
-  "use_cache": true,
-  "vocab_size": 30522
-}

+{
+  "_name_or_path": "microsoft/layoutlm-base-uncased",
+  "architectures": [
+    "LayoutLMForTokenClassification"
+  ],
+  "attention_probs_dropout_prob": 0.1,
+  "hidden_act": "gelu",
+  "hidden_dropout_prob": 0.1,
+  "hidden_size": 768,
+  "id2label": {
+    "0": "O",
+    "1": "B-HEADER",
+    "2": "I-HEADER",
+    "3": "B-QUESTION",
+    "4": "I-QUESTION",
+    "5": "B-ANSWER",
+    "6": "I-ANSWER"
+  },
+  "initializer_range": 0.02,
+  "intermediate_size": 3072,
+  "label2id": {
+    "B-ANSWER": 5,
+    "B-HEADER": 1,
+    "B-QUESTION": 3,
+    "I-ANSWER": 6,
+    "I-HEADER": 2,
+    "I-QUESTION": 4,
+    "O": 0
+  },
+  "layer_norm_eps": 1e-12,
+  "max_2d_position_embeddings": 1024,
+  "max_position_embeddings": 512,
+  "model_type": "layoutlm",
+  "num_attention_heads": 12,
+  "num_hidden_layers": 12,
+  "output_past": true,
+  "pad_token_id": 0,
+  "position_embedding_type": "absolute",
+  "transformers_version": "4.46.0",
+  "type_vocab_size": 2,
+  "use_cache": true,
+  "vocab_size": 30522
+}

tf_model.h5 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:39d6c39753fb651e8daee38eff915fd6de66d1f10dcae131081cccb607bf1e56
 size 450829256

 version https://git-lfs.github.com/spec/v1
+oid sha256:b9286a81272aa290094f83fd09eff39d61d949915fd0e7915cfd1aac94b4b96f
 size 450829256