jadechoghari commited on
Commit
e713156
1 Parent(s): 1f78525

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +12 -19
README.md CHANGED
@@ -7,9 +7,7 @@ library_name: transformers
7
 
8
  VoiceRestore is a cutting-edge speech restoration model designed to significantly enhance the quality of degraded voice recordings. Leveraging flow-matching transformers, this model excels at addressing a wide range of audio imperfections commonly found in speech, including background noise, reverberation, distortion, and signal loss.
9
 
10
- Demo of audio restorations: [VoiceRestore](https://sparkling-rabanadas-3082be.netlify.app/)
11
-
12
- Credits: This repository is based on the [E2-TTS implementation by Lucidrains](https://github.com/lucidrains/e2-tts-pytorch)
13
 
14
  ## Usage
15
  ``` bash
@@ -33,28 +31,23 @@ model("test_input.wav", "test_output.wav")
33
  ## Example
34
  ### Degraded Input:
35
 
36
- ![Degraded Input](./imgs/degraded.png "Degraded Input")
37
-
38
- Degraded audio (reverberation, distortion, noise, random cut):
39
 
40
- **Note**: Adjust your volume before playing the degraded audio sample, as it may contain distortions.
41
-
42
- https://github.com/user-attachments/assets/0c030274-60b5-41a4-abe6-59a3f1bc934b
 
43
 
44
  ---
45
  ### Restored (steps=32, cfg=1.0):
46
 
47
- ![Restored](./imgs/restored.png "Restored")
 
 
 
48
 
49
  Restored audio - 16 steps, strength 0.5:
50
 
51
- https://github.com/user-attachments/assets/fdbbb988-9bd2-4750-bddd-32bd5153d254
52
-
53
- ---
54
- ### Ground Truth:
55
-
56
- ![Ground Truth](./imgs/ground_truth.png "Ground Truth")
57
-
58
  ---
59
  ## Key Features
60
 
@@ -65,7 +58,6 @@ https://github.com/user-attachments/assets/fdbbb988-9bd2-4750-bddd-32bd5153d254
65
  ---
66
 
67
 
68
-
69
  ## Model Details
70
 
71
  - **Architecture**: Flow-matching transformer
@@ -99,4 +91,5 @@ This project is licensed under the MIT License - see the [LICENSE](LICENSE) file
99
  ## Acknowledgments
100
 
101
  - Based on the [E2-TTS implementation by Lucidrains](https://github.com/lucidrains/e2-tts-pytorch)
102
- - Special thanks to the open-source community for their invaluable contributions.
 
 
7
 
8
  VoiceRestore is a cutting-edge speech restoration model designed to significantly enhance the quality of degraded voice recordings. Leveraging flow-matching transformers, this model excels at addressing a wide range of audio imperfections commonly found in speech, including background noise, reverberation, distortion, and signal loss.
9
 
10
+ It is based on this [repo](https://github.com/skirdey/voicerestore) & demo of audio restorations: [VoiceRestore](https://sparkling-rabanadas-3082be.netlify.app/)
 
 
11
 
12
  ## Usage
13
  ``` bash
 
31
  ## Example
32
  ### Degraded Input:
33
 
34
+ ### Degraded Input Audio
 
 
35
 
36
+ <audio controls>
37
+ <source src="https://huggingface.co/jadechoghari/VoiceRestore/resolve/main/test_input.wav" type="audio/mpeg">
38
+ Your browser does not support the audio element.
39
+ </audio>
40
 
41
  ---
42
  ### Restored (steps=32, cfg=1.0):
43
 
44
+ <audio controls>
45
+ <source src="https://huggingface.co/jadechoghari/VoiceRestore/resolve/main/test_output.wav" type="audio/mpeg">
46
+ Your browser does not support the audio element.
47
+ </audio>
48
 
49
  Restored audio - 16 steps, strength 0.5:
50
 
 
 
 
 
 
 
 
51
  ---
52
  ## Key Features
53
 
 
58
  ---
59
 
60
 
 
61
  ## Model Details
62
 
63
  - **Architecture**: Flow-matching transformer
 
91
  ## Acknowledgments
92
 
93
  - Based on the [E2-TTS implementation by Lucidrains](https://github.com/lucidrains/e2-tts-pytorch)
94
+ - Special thanks to the open-source community for their invaluable contributions.
95
+ - Credits: This repository is based on the [E2-TTS implementation by Lucidrains](https://github.com/lucidrains/e2-tts-pytorch)