Text-to-Speech
F5-TTS
File size: 942 Bytes
8f0fd09
b6c43d1
f0b2793
56cbaf3
b6c43d1
 
8f0fd09
0cce16e
b6c43d1
0a5fb31
f0b2793
 
 
 
 
 
 
d34a3c9
 
 
 
 
 
 
 
bbee0f8
1903015
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
---
license: cc-by-nc-4.0
pipeline_tag: text-to-speech
library_name: f5-tts
datasets:
- amphion/Emilia-Dataset
---
### 2024/10/14. We change the License of this ckpt repo to CC-BY-NC-4.0 following the used training set Emilia, which is an in-the-wild dataset. Sorry for any inconvenience this may cause. Our codebase remains under the MIT license.

Download [F5-TTS](https://huggingface.co/SWivid/F5-TTS/tree/main/F5TTS_Base) or [E2 TTS](https://huggingface.co/SWivid/E2-TTS/tree/main/E2TTS_Base) and place under ckpts/
```
ckpts/
    E2TTS_Base/
        model_1200000.pt
    F5TTS_Base/
        model_1200000.pt
```
Inference with .safetensors option
```
ckpts/
    E2TTS_Base/
        model_1200000.safetensors
    F5TTS_Base/
        model_1200000.safetensors
```
Github: https://github.com/SWivid/F5-TTS      
Paper: [F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching](https://huggingface.co/papers/2410.06885)