Update README.md
Browse files
README.md
CHANGED
@@ -121,7 +121,7 @@ Please refer to [togethercomputer/RedPajama-Data-1T](https://huggingface.co/data
|
|
121 |
|
122 |
- **Hardware:** 512 nodes of 6xV100 (IBM Power9), on the OLCF Summit cluster
|
123 |
- **Optimizer:** Apex FusedAdam
|
124 |
-
- **Parallelism:** Pipeline parallel 12,
|
125 |
- **Gradient Accumulations**: 8 (global batch size 4M tokens)
|
126 |
- **Num of Tokens:** 800B Tokens
|
127 |
- **Learning rate:** 0.00012
|
|
|
121 |
|
122 |
- **Hardware:** 512 nodes of 6xV100 (IBM Power9), on the OLCF Summit cluster
|
123 |
- **Optimizer:** Apex FusedAdam
|
124 |
+
- **Parallelism:** Pipeline parallel 12, tensor parallel 2
|
125 |
- **Gradient Accumulations**: 8 (global batch size 4M tokens)
|
126 |
- **Num of Tokens:** 800B Tokens
|
127 |
- **Learning rate:** 0.00012
|