gonzalobenegas commited on
Commit
9849233
1 Parent(s): 4fe2b29

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +20 -0
README.md ADDED
@@ -0,0 +1,20 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ tags:
4
+ - biology
5
+ - genomics
6
+ - dna
7
+ ---
8
+
9
+ # Tokenizer for causal language modeling of DNA sequences
10
+
11
+ ```json
12
+ "vocab": {
13
+ "[PAD]": 0,
14
+ "[UNK]": 1,
15
+ "a": 2,
16
+ "c": 3,
17
+ "g": 4,
18
+ "t": 5,
19
+ },
20
+ ```