|
--- |
|
license: apache-2.0 |
|
language: |
|
- en |
|
--- |
|
|
|
<p align="center"> |
|
🔗 <a href="https://rhymes.ai/" target="_blank"> Try Aria!</a> · 📖 <a href="https://www.rhymes.ai/blog-details/aria-first-open-multimodal-native-moe-model" target="_blank">Blog</a> · 📌 <a href="https://arxiv.org/pdf/2410.05993" target="_blank">Paper</a> |
|
· ⭐<a href="https://github.com/rhymes-ai/Aria" target="_blank">GitHub</a> |
|
</p> |
|
|
|
# Gallery |
|
<img src="https://huggingface.co/rhymes-ai/Allegro/resolve/main/gallery.gif" width="1000" height="800"/>For more demos and corresponding prompts, see the [Allegro Gallery](TBD). |
|
|
|
|
|
# Key Feature |
|
Allegro is capable of producing high-quality, 6-second videos at 30 frames per second and 720p resolution from simple text prompts. |
|
- xxx |
|
- xxx |
|
- xxx |
|
|
|
# Model info |
|
|
|
<table> |
|
<tr> |
|
<th>Model</th> |
|
<td>Allegro</td> |
|
</tr> |
|
<tr> |
|
<th>Description</th> |
|
<td>Text-to-Video Diffusion Transformer</td> |
|
</tr> |
|
<tr> |
|
<th>Download</th> |
|
<td><HF link - TBD></td> |
|
</tr> |
|
<tr> |
|
<th rowspan="2">Parameter</th> |
|
<td>VAE: 175M</td> |
|
</tr> |
|
<tr> |
|
<td>DiT: 2.8B</td> |
|
</tr> |
|
<tr> |
|
<th rowspan="2">Inference Precision</th> |
|
<td>VAE: FP32/TF32/BF16/FP16 (best in FP32/TF32)</td> |
|
</tr> |
|
<tr> |
|
<td>DiT/T5: BF16/FP32/TF32</td> |
|
</tr> |
|
<tr> |
|
<th>Context Length</th> |
|
<td>79.2k</td> |
|
</tr> |
|
<tr> |
|
<th>Resolution</th> |
|
<td>720 x 1280</td> |
|
</tr> |
|
<tr> |
|
<th>Frames</th> |
|
<td>88</td> |
|
</tr> |
|
<tr> |
|
<th>Video Length</th> |
|
<td>6 seconds @ 15 fps</td> |
|
</tr> |
|
<tr> |
|
<th>Single GPU Memory Usage</th> |
|
<td>9.3G BF16 (with cpu_offload)</td> |
|
</tr> |
|
</table> |
|
|
|
|
|
# Quick start |
|
|
|
# License |
|
This repo is released under the Apache 2.0 License. |
|
|
|
# Disclaimer |
|
The Allegro models are provided on an "AS IS" basis, and we disclaim any liability for consequences or damages arising from your use. Users are kindly advised to ensure compliance with all applicable laws and regulations. This includes, but is not limited to, prohibitions against illegal activities and the generation of content that is violent, pornographic, obscene, or otherwise deemed non-safe, inappropriate, or illegal. By using these models, you agree that we shall not be held accountable for any consequences resulting from your use. |