# Amphion Text-to-Audio (TTA) Recipe
## Quick Start
We provide a **[beginner recipe](RECIPE.md)** to demonstrate how to train a cutting edge TTA model. Specifically, it is designed as a latent diffusion model like [AudioLDM](https://arxiv.org/abs/2301.12503), [Make-an-Audio](https://arxiv.org/abs/2301.12661), and [AUDIT](https://arxiv.org/abs/2304.00830).
## Supported Model Architectures
Until now, Amphion has supported a latent diffusion based text-to-audio model: