redadmiral commited on
Commit
9dbc755
1 Parent(s): b7971b7

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +28 -0
README.md ADDED
@@ -0,0 +1,28 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ This Model is a fine-tuned version of T-systems [summarization model v1](https://huggingface.co/deutsche-telekom/mt5-small-sum-de-en-v1).
2
+
3
+ We used 1000 examples of headline-content pairs from BR24 articles for the fine-tuning process.
4
+
5
+ Despite the small amount of training data, the tonality of the summarizations has changed significantly. Many of the resulting summaries do sound like a headline.
6
+
7
+ ## Training
8
+
9
+ We used the following parameters for training this model:
10
+
11
+ + base model: deutsche-telekom/mt5-small-sum-de-en-v1
12
+ + source_prefix: "summarize: "
13
+ + batch size: 4
14
+ + max_source_length: 400
15
+ + max_target_length: 35
16
+ + weight_decay: 0.01
17
+ + number of train epochs: 1
18
+ + learning rate: 5e-5
19
+
20
+ ## License
21
+
22
+ Since the base model is trained on the MLSUM dataset, this model may not be used for commercial use.
23
+
24
+ ## Stats
25
+
26
+ | Model | Rouge1 | Rouge2 | RougeL | RougeLSum |
27
+ |------------------------------|-----------|----------|-----------|-----------|
28
+ | headlines_test_small_example | 13.573500 | 3.694700 | 12.560600 | 12.60000 |