--- license: apache-2.0 language: - en library_name: diffusers ---
Gallery · GitHub · Blog · Paper · Discord
# Gallery For more demos and corresponding prompts, see the [Allegro Gallery](https://rhymes.ai/allegro_gallery). # Key Feature - **Open Source**: Full [model weights](https://huggingface.co/rhymes-ai/Allegro) and [code](https://github.com/rhymes-ai/Allegro) available to the community, Apache 2.0! - **Versatile Content Creation**: Capable of generating a wide range of content, from close-ups of humans and animals to diverse dynamic scenes. - **High-Quality Output**: Generate detailed 6-second videos at 15 FPS with 720x1280 resolution, which can be interpolated to 30 FPS with [EMA-VFI](https://github.com/MCG-NJU/EMA-VFI). - **Small and Efficient**: Features a 175M parameter VideoVAE and a 2.8B parameter VideoDiT model. Supports multiple precisions (FP32, BF16, FP16) and uses 9.3 GB of GPU memory in BF16 mode with CPU offloading. Context length is 79.2K, equivalent to 88 frames. # Model infoModel | Allegro |
---|---|
Description | Text-to-Video Generation Model |
Download | Hugging Face |
Parameter | VAE: 175M |
DiT: 2.8B | |
Inference Precision | VAE: FP32/TF32/BF16/FP16 (best in FP32/TF32) |
DiT/T5: BF16/FP32/TF32 | |
Context Length | 79.2K |
Resolution | 720 x 1280 |
Frames | 88 |
Video Length | 6 seconds @ 15 fps |
Single GPU Memory Usage | 9.3G BF16 (with cpu_offload) |