Canwen Xu
commited on
Commit
•
05d7855
1
Parent(s):
4da8245
Update README.md
Browse files
README.md
CHANGED
@@ -45,7 +45,7 @@ Based on the hyper-parameter searching on the learning rate and batch size, we s
|
|
45 |
|
46 |
## Eval results
|
47 |
|
48 |
-
| |
|
49 |
|------------|-------------------:|--------------------:|-------------------:|-------------------:|------------------:|
|
50 |
| CPM-Small | 109M | 12 | 768 | 12 | 64 |
|
51 |
| CPM-Medium | 334M | 24 | 1,024 | 16 | 64 |
|
|
|
45 |
|
46 |
## Eval results
|
47 |
|
48 |
+
| | n_param | n_layers | d_model | n_heads | d_head |
|
49 |
|------------|-------------------:|--------------------:|-------------------:|-------------------:|------------------:|
|
50 |
| CPM-Small | 109M | 12 | 768 | 12 | 64 |
|
51 |
| CPM-Medium | 334M | 24 | 1,024 | 16 | 64 |
|