Spaces:
Running
on
Zero
Running
on
Zero
mrfakename
commited on
Commit
•
aa54b55
1
Parent(s):
7112749
Sync from GitHub repo
Browse filesThis Space is synced from the GitHub repo: https://github.com/SWivid/F5-TTS. Please submit contributions to the Space there
- README_REPO.md +10 -1
README_REPO.md
CHANGED
@@ -1,16 +1,25 @@
|
|
1 |
# F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching
|
2 |
|
|
|
|
|
|
|
|
|
|
|
|
|
3 |
[![python](https://img.shields.io/badge/Python-3.10-brightgreen)](https://github.com/SWivid/F5-TTS)
|
4 |
[![arXiv](https://img.shields.io/badge/arXiv-2410.06885-b31b1b.svg?logo=arXiv)](https://arxiv.org/abs/2410.06885)
|
5 |
[![demo](https://img.shields.io/badge/GitHub-Demo%20page-blue.svg)](https://swivid.github.io/F5-TTS/)
|
6 |
[![space](https://img.shields.io/badge/🤗-Space%20demo-yellow)](https://huggingface.co/spaces/mrfakename/E2-F5-TTS)
|
|
|
7 |
|
8 |
**F5-TTS**: Diffusion Transformer with ConvNeXt V2, faster trained and inference.
|
9 |
|
10 |
-
**E2 TTS**: Flat-UNet Transformer, closest reproduction.
|
11 |
|
12 |
**Sway Sampling**: Inference-time flow step sampling strategy, greatly improves performance
|
13 |
|
|
|
|
|
14 |
## Installation
|
15 |
|
16 |
Clone the repository:
|
|
|
1 |
# F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching
|
2 |
|
3 |
+
<div style="position: relative; width: 100%;">
|
4 |
+
<div style="position: absolute; top: 0; right: 0;">
|
5 |
+
<img src="https://avatars.githubusercontent.com/u/35554183?s=200&v=4" alt="Watermark" style="width: 140px; height: auto;">
|
6 |
+
</div>
|
7 |
+
</div>
|
8 |
+
|
9 |
[![python](https://img.shields.io/badge/Python-3.10-brightgreen)](https://github.com/SWivid/F5-TTS)
|
10 |
[![arXiv](https://img.shields.io/badge/arXiv-2410.06885-b31b1b.svg?logo=arXiv)](https://arxiv.org/abs/2410.06885)
|
11 |
[![demo](https://img.shields.io/badge/GitHub-Demo%20page-blue.svg)](https://swivid.github.io/F5-TTS/)
|
12 |
[![space](https://img.shields.io/badge/🤗-Space%20demo-yellow)](https://huggingface.co/spaces/mrfakename/E2-F5-TTS)
|
13 |
+
[![lab](https://img.shields.io/badge/X--LANCE-Lab-grey?labelColor=lightgrey)](https://x-lance.sjtu.edu.cn/)
|
14 |
|
15 |
**F5-TTS**: Diffusion Transformer with ConvNeXt V2, faster trained and inference.
|
16 |
|
17 |
+
**E2 TTS**: Flat-UNet Transformer, closest reproduction from [paper](https://arxiv.org/abs/2406.18009).
|
18 |
|
19 |
**Sway Sampling**: Inference-time flow step sampling strategy, greatly improves performance
|
20 |
|
21 |
+
### Thanks to all the contributors !
|
22 |
+
|
23 |
## Installation
|
24 |
|
25 |
Clone the repository:
|