jadechoghari
commited on
Commit
•
e713156
1
Parent(s):
1f78525
Update README.md
Browse files
README.md
CHANGED
@@ -7,9 +7,7 @@ library_name: transformers
|
|
7 |
|
8 |
VoiceRestore is a cutting-edge speech restoration model designed to significantly enhance the quality of degraded voice recordings. Leveraging flow-matching transformers, this model excels at addressing a wide range of audio imperfections commonly found in speech, including background noise, reverberation, distortion, and signal loss.
|
9 |
|
10 |
-
|
11 |
-
|
12 |
-
Credits: This repository is based on the [E2-TTS implementation by Lucidrains](https://github.com/lucidrains/e2-tts-pytorch)
|
13 |
|
14 |
## Usage
|
15 |
``` bash
|
@@ -33,28 +31,23 @@ model("test_input.wav", "test_output.wav")
|
|
33 |
## Example
|
34 |
### Degraded Input:
|
35 |
|
36 |
-
|
37 |
-
|
38 |
-
Degraded audio (reverberation, distortion, noise, random cut):
|
39 |
|
40 |
-
|
41 |
-
|
42 |
-
|
|
|
43 |
|
44 |
---
|
45 |
### Restored (steps=32, cfg=1.0):
|
46 |
|
47 |
-
|
|
|
|
|
|
|
48 |
|
49 |
Restored audio - 16 steps, strength 0.5:
|
50 |
|
51 |
-
https://github.com/user-attachments/assets/fdbbb988-9bd2-4750-bddd-32bd5153d254
|
52 |
-
|
53 |
-
---
|
54 |
-
### Ground Truth:
|
55 |
-
|
56 |
-
![Ground Truth](./imgs/ground_truth.png "Ground Truth")
|
57 |
-
|
58 |
---
|
59 |
## Key Features
|
60 |
|
@@ -65,7 +58,6 @@ https://github.com/user-attachments/assets/fdbbb988-9bd2-4750-bddd-32bd5153d254
|
|
65 |
---
|
66 |
|
67 |
|
68 |
-
|
69 |
## Model Details
|
70 |
|
71 |
- **Architecture**: Flow-matching transformer
|
@@ -99,4 +91,5 @@ This project is licensed under the MIT License - see the [LICENSE](LICENSE) file
|
|
99 |
## Acknowledgments
|
100 |
|
101 |
- Based on the [E2-TTS implementation by Lucidrains](https://github.com/lucidrains/e2-tts-pytorch)
|
102 |
-
- Special thanks to the open-source community for their invaluable contributions.
|
|
|
|
7 |
|
8 |
VoiceRestore is a cutting-edge speech restoration model designed to significantly enhance the quality of degraded voice recordings. Leveraging flow-matching transformers, this model excels at addressing a wide range of audio imperfections commonly found in speech, including background noise, reverberation, distortion, and signal loss.
|
9 |
|
10 |
+
It is based on this [repo](https://github.com/skirdey/voicerestore) & demo of audio restorations: [VoiceRestore](https://sparkling-rabanadas-3082be.netlify.app/)
|
|
|
|
|
11 |
|
12 |
## Usage
|
13 |
``` bash
|
|
|
31 |
## Example
|
32 |
### Degraded Input:
|
33 |
|
34 |
+
### Degraded Input Audio
|
|
|
|
|
35 |
|
36 |
+
<audio controls>
|
37 |
+
<source src="https://huggingface.co/jadechoghari/VoiceRestore/resolve/main/test_input.wav" type="audio/mpeg">
|
38 |
+
Your browser does not support the audio element.
|
39 |
+
</audio>
|
40 |
|
41 |
---
|
42 |
### Restored (steps=32, cfg=1.0):
|
43 |
|
44 |
+
<audio controls>
|
45 |
+
<source src="https://huggingface.co/jadechoghari/VoiceRestore/resolve/main/test_output.wav" type="audio/mpeg">
|
46 |
+
Your browser does not support the audio element.
|
47 |
+
</audio>
|
48 |
|
49 |
Restored audio - 16 steps, strength 0.5:
|
50 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
51 |
---
|
52 |
## Key Features
|
53 |
|
|
|
58 |
---
|
59 |
|
60 |
|
|
|
61 |
## Model Details
|
62 |
|
63 |
- **Architecture**: Flow-matching transformer
|
|
|
91 |
## Acknowledgments
|
92 |
|
93 |
- Based on the [E2-TTS implementation by Lucidrains](https://github.com/lucidrains/e2-tts-pytorch)
|
94 |
+
- Special thanks to the open-source community for their invaluable contributions.
|
95 |
+
- Credits: This repository is based on the [E2-TTS implementation by Lucidrains](https://github.com/lucidrains/e2-tts-pytorch)
|