Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -144,7 +144,7 @@ We adapted the original Falcon-7B model to Spanish and Catalan by swapping the t
 The training corpus consists 26B tokens of several corpora gathered from web crawlings and public corpora.
-| Dataset             | Language | Tokens (pre-epoch) | Epochs       |
 |---------------------|----------|--------------------|--------------|
 | Wikipedia           | en       |           2169.97M |  1.428144485 |
 | C4_es               | es       |          53709.80M | 0.1049686196 |

 The training corpus consists 26B tokens of several corpora gathered from web crawlings and public corpora.
+| Dataset             | Language | Tokens (per-epoch) | Epochs       |
 |---------------------|----------|--------------------|--------------|
 | Wikipedia           | en       |           2169.97M |  1.428144485 |
 | C4_es               | es       |          53709.80M | 0.1049686196 |