patrixtano
/

mt5-small-finetuned-anaphora_czech

Text2Text Generation

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

patrixtano commited on Sep 12

Commit

0e5b67a

•

1 Parent(s): b1bab76

End of training

Files changed (2) hide show

README.md +9 -9
model.safetensors +1 -1

README.md CHANGED Viewed

@@ -16,8 +16,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.5256
-- Score: 43.6341
 - Char Order: 6
 - Word Order: 0
 - Beta: 2
@@ -40,8 +40,8 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 16
-- eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
@@ -49,11 +49,11 @@ The following hyperparameters were used during training:
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Score   | Char Order | Word Order | Beta |
-|:-------------:|:-----:|:----:|:---------------:|:-------:|:----------:|:----------:|:----:|
-| 1.2999        | 1.0   | 2105 | 0.7759          | 36.9398 | 6          | 0          | 2    |
-| 0.87          | 2.0   | 4210 | 0.5735          | 41.0183 | 6          | 0          | 2    |
-| 0.7796        | 3.0   | 6315 | 0.5256          | 43.6341 | 6          | 0          | 2    |
 ### Framework versions

 This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0560
+- Score: 28.8160
 - Char Order: 6
 - Word Order: 0
 - Beta: 2
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 2
+- eval_batch_size: 2
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 ### Training results
+| Training Loss | Epoch | Step  | Validation Loss | Score   | Char Order | Word Order | Beta |
+|:-------------:|:-----:|:-----:|:---------------:|:-------:|:----------:|:----------:|:----:|
+| 0.1671        | 1.0   | 23181 | 0.0741          | 28.6976 | 6          | 0          | 2    |
+| 0.1169        | 2.0   | 46362 | 0.0598          | 28.7935 | 6          | 0          | 2    |
+| 0.1072        | 3.0   | 69543 | 0.0560          | 28.8160 | 6          | 0          | 2    |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c43f372203633cca77b599fdcc503eba5c6592663f83f17575d36e6dc9ad48e2
 size 1200729512

 version https://git-lfs.github.com/spec/v1
+oid sha256:a686c0a08a1e4aa831b99de81bf96fc95d655b4b97cc3dffc19c6ad834862cf2
 size 1200729512