OpenNLPLab
commited on
Commit
•
9cb6819
1
Parent(s):
a227b70
Update README.md
Browse files
README.md
CHANGED
@@ -45,8 +45,8 @@ This official repo introduces the TransNormerLLM model, featuring its open-sourc
|
|
45 |
- **TransNormerLLM1** incorporates Simple GLU in its channel mixer, GLA in the token mixer, and SRMSNorm for normalization. In this model, the channel and token mixers function sequentially in a pipeline arrangement.
|
46 |
- **TransNormerLLM2** also utilizes Simple GLU in the channel mixer, GLA in the token mixer, and SRMSNorm for normalization. However, in this version, the channel and token mixers operate concurrently, in parallel.
|
47 |
|
48 |
-
<p align="
|
49 |
-
<img src="./images/TransNormer1-2Arch.jpg" width="
|
50 |
</p>
|
51 |
|
52 |
# Released Weights
|
@@ -191,6 +191,6 @@ If you wish to cite our work, please use the following reference:
|
|
191 |
```
|
192 |
|
193 |
<p align="center">
|
194 |
-
<img src="./images/lightning-leopard.jpg" width="
|
195 |
- OpenNLPLab @2024 -
|
196 |
</p>
|
|
|
45 |
- **TransNormerLLM1** incorporates Simple GLU in its channel mixer, GLA in the token mixer, and SRMSNorm for normalization. In this model, the channel and token mixers function sequentially in a pipeline arrangement.
|
46 |
- **TransNormerLLM2** also utilizes Simple GLU in the channel mixer, GLA in the token mixer, and SRMSNorm for normalization. However, in this version, the channel and token mixers operate concurrently, in parallel.
|
47 |
|
48 |
+
<p align="center">
|
49 |
+
<img src="./images/TransNormer1-2Arch.jpg" width="85%" />
|
50 |
</p>
|
51 |
|
52 |
# Released Weights
|
|
|
191 |
```
|
192 |
|
193 |
<p align="center">
|
194 |
+
<img src="./images/lightning-leopard.jpg" width="67%" />
|
195 |
- OpenNLPLab @2024 -
|
196 |
</p>
|