bthomas commited on
Commit
ea7b94d
1 Parent(s): f83992d

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +73 -0
README.md ADDED
@@ -0,0 +1,73 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ tags:
4
+ - mlm
5
+ - generated_from_trainer
6
+ model-index:
7
+ - name: article2keyword2.1b_paraphrase-multilingual-MiniLM-L12-v2_finetuned_for_mlm
8
+ results: []
9
+ ---
10
+
11
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
12
+ should probably proofread and complete it, then remove this comment. -->
13
+
14
+ # article2keyword2.1b_paraphrase-multilingual-MiniLM-L12-v2_finetuned_for_mlm
15
+
16
+ This model is a fine-tuned version of [sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2](https://huggingface.co/sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2) on an unknown dataset.
17
+ It achieves the following results on the evaluation set:
18
+ - Loss: 0.0673
19
+
20
+ ## Model description
21
+
22
+ More information needed
23
+
24
+ ## Intended uses & limitations
25
+
26
+ More information needed
27
+
28
+ ## Training and evaluation data
29
+
30
+ More information needed
31
+
32
+ ## Training procedure
33
+
34
+ ### Training hyperparameters
35
+
36
+ The following hyperparameters were used during training:
37
+ - learning_rate: 2e-05
38
+ - train_batch_size: 4
39
+ - eval_batch_size: 4
40
+ - seed: 42
41
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
+ - lr_scheduler_type: linear
43
+ - num_epochs: 16
44
+ - mixed_precision_training: Native AMP
45
+
46
+ ### Training results
47
+
48
+ | Training Loss | Epoch | Step | Validation Loss |
49
+ |:-------------:|:-----:|:-----:|:---------------:|
50
+ | 2.3777 | 1.0 | 1353 | 0.3168 |
51
+ | 0.2358 | 2.0 | 2706 | 0.1564 |
52
+ | 0.1372 | 3.0 | 4059 | 0.1149 |
53
+ | 0.1046 | 4.0 | 5412 | 0.0956 |
54
+ | 0.086 | 5.0 | 6765 | 0.0853 |
55
+ | 0.0741 | 6.0 | 8118 | 0.0786 |
56
+ | 0.0653 | 7.0 | 9471 | 0.0750 |
57
+ | 0.0594 | 8.0 | 10824 | 0.0726 |
58
+ | 0.0542 | 9.0 | 12177 | 0.0699 |
59
+ | 0.0504 | 10.0 | 13530 | 0.0692 |
60
+ | 0.047 | 11.0 | 14883 | 0.0684 |
61
+ | 0.0444 | 12.0 | 16236 | 0.0675 |
62
+ | 0.0423 | 13.0 | 17589 | 0.0674 |
63
+ | 0.0404 | 14.0 | 18942 | 0.0673 |
64
+ | 0.0392 | 15.0 | 20295 | 0.0672 |
65
+ | 0.0379 | 16.0 | 21648 | 0.0673 |
66
+
67
+
68
+ ### Framework versions
69
+
70
+ - Transformers 4.21.1
71
+ - Pytorch 1.11.0
72
+ - Datasets 2.3.2
73
+ - Tokenizers 0.12.1