sberbank-ai
commited on
Commit
•
060b7d9
1
Parent(s):
2d19463
Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,38 @@
|
|
1 |
---
|
2 |
license: apache-2.0
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
license: apache-2.0
|
3 |
+
tags:
|
4 |
+
- Kandinsky
|
5 |
+
- text-image
|
6 |
+
- text2image
|
7 |
+
- diffusion
|
8 |
+
- latent diffusion
|
9 |
+
- mCLIP-XLMR
|
10 |
+
- mT5
|
11 |
---
|
12 |
+
|
13 |
+
# Kandinsky 2.0
|
14 |
+
Kandinsky 2.0 - the first multilingual latent diffusion text2image model.
|
15 |
+
*UNet size: 1.2B parameters*
|
16 |
+
|
17 |
+
<img src="https://github.com/ai-forever/NATALLE/blob/main/content/NatallE.png" width=100% border="2"/>
|
18 |
+
|
19 |
+
It is a latent diffusion model with two multi-lingual text encoders:
|
20 |
+
* mCLIP-XLMR (344M parameters)
|
21 |
+
* mT5-small (300M parameters)
|
22 |
+
|
23 |
+
|
24 |
+
These encoders and multilingual training datasets unveil the real multilingual text2image generation experience!
|
25 |
+
|
26 |
+
|
27 |
+
# Authors
|
28 |
+
|
29 |
+
+ Arseniy Shakhmatov: [Github](https://github.com/cene555), [Blog](https://t.me/gradientdip)
|
30 |
+
+ Anton Razzhigaev: [Github](https://github.com/razzant), [Blog](https://t.me/abstractDL)
|
31 |
+
+ Aleksandr Nikolich: [Github](https://github.com/AlexWortega), [Blog](https://t.me/lovedeathtransformers)
|
32 |
+
+ Vladimir Arkhipkin: [Github](https://github.com/oriBetelgeuse)
|
33 |
+
+ Igor Pavlov: [Github](https://github.com/boomb0om)
|
34 |
+
+ Andrey Kuznetsov: [Github](https://github.com/kuznetsoffandrey)
|
35 |
+
+ Denis Dimitrov: [Github](https://github.com/denndimitrov)
|
36 |
+
|
37 |
+
|
38 |
+
|