luel commited on
Commit
a2b9b21
1 Parent(s): c9735b6

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +42 -0
README.md ADDED
@@ -0,0 +1,42 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language: ti
3
+ tags:
4
+ - tigrinya
5
+ - gpt2
6
+ - text-generation
7
+ ---
8
+
9
+ # GPT-2 for Tigrinya Language
10
+
11
+ This repository contains a GPT-2 model trained from scratch on Tigrinya text data. The model was trained using the Hugging Face Transformers library.
12
+
13
+ ## Model Details
14
+
15
+ - Model Type: GPT-2
16
+ - Language: Tigrinya
17
+ - Vocabulary Size: 16000
18
+ - Maximum Length: 128
19
+ - Model Size: Small
20
+ - Number of Parameters: 33,523,200
21
+
22
+ ## Training Details
23
+
24
+ - Number of Epochs: 12
25
+ - Batch Size: 1 (with gradient accumulation steps of 4)
26
+ - Learning Rate: 5e-4
27
+
28
+ ## Dataset Statistics
29
+ - Total number of words: 16061839
30
+ - Total number of unique words: 458901
31
+
32
+ ## Usage
33
+
34
+ ```python
35
+ from transformers import pipeline
36
+
37
+ # Load the model
38
+ generator = pipeline('text-generation', model='luel/gpt2-tigrinya-small')
39
+
40
+ # Generate text
41
+ text = generator("ትግራይ", max_length=60)
42
+ print(text)