Spaces:
Running
Running
Yuekai Zhang
commited on
Commit
•
39eebc7
1
Parent(s):
4a0a58a
update index
Browse files- index.html +21 -11
index.html
CHANGED
@@ -40,7 +40,7 @@
|
|
40 |
In addition, we find VALL-E could preserve the speaker's emotion and acoustic environment of the acoustic prompt in synthesis.
|
41 |
</p>
|
42 |
|
43 |
-
<br>
|
44 |
official demo page: <a href="https://valle-demo.github.io/">https://valle-demo.github.io</a>
|
45 |
<br>
|
46 |
|
@@ -48,9 +48,9 @@
|
|
48 |
my implementation: <a href="https://github.com/lifeiteng/vall-e">https://github.com/lifeiteng/vall-e</a>
|
49 |
<br><br>
|
50 |
This page is for showing reproduced results only, I keep the main parts of the official demo.
|
51 |
-
<br><br>
|
52 |
|
53 |
-
<h2 id="model-configs" style="text-align: center;">Model Configs</h2>
|
54 |
<div class="table-responsive pt-3">
|
55 |
<table class="table table-hover pt-2">
|
56 |
<thead>
|
@@ -81,11 +81,11 @@
|
|
81 |
</tr>
|
82 |
</tbody>
|
83 |
</table>
|
84 |
-
</div>
|
85 |
|
86 |
</div>
|
87 |
|
88 |
-
<div class="container pt-5 mt-5 shadow p-5 mb-5 bg-white rounded">
|
89 |
<h2 id="model-overview" style="text-align: center;">Model Overview</h2>
|
90 |
<body>
|
91 |
<p style="text-align: center;">
|
@@ -98,9 +98,9 @@
|
|
98 |
VALL-E generates the discrete audio codec codes based on phoneme and acoustic code prompts, corresponding to the target content and the speaker's voice.
|
99 |
VALL-E directly enables various speech synthesis applications, such as zero-shot TTS, speech editing, and content creation combined with other generative AI models like GPT-3.
|
100 |
</p>
|
101 |
-
</div>
|
102 |
|
103 |
-
<div class="container pt-5 mt-5 shadow p-5 mb-5 bg-white rounded">
|
104 |
<h2 id="ljspeech-samples" style="text-align: center;">LJSpeech Samples</h2>
|
105 |
<div class="table-responsive pt-3">
|
106 |
<table class="table table-hover pt-2">
|
@@ -127,7 +127,7 @@
|
|
127 |
</tbody>
|
128 |
</table>
|
129 |
</div>
|
130 |
-
</div>
|
131 |
|
132 |
|
133 |
<div class="container pt-5 mt-5 shadow p-5 mb-5 bg-white rounded">
|
@@ -140,7 +140,9 @@
|
|
140 |
<th style="text-align: center">Speaker Prompt</th>
|
141 |
<th style="text-align: center">Ground Truth</th>
|
142 |
<th style="text-align: center">VALL-E</th>
|
143 |
-
<th style="text-align: center">LibriTTS
|
|
|
|
|
144 |
</tr>
|
145 |
</thead>
|
146 |
<tbody>
|
@@ -149,6 +151,8 @@
|
|
149 |
<td style="text-align: center"><audio controls="controls" style="width: 140px;"><source src="audios/librispeech/61-70970-0024/gt.wav" autoplay/>Your browser does not support the audio element.</audio></td>
|
150 |
<td style="text-align: center"><audio controls="controls" style="width: 140px;"><source src="audios/librispeech/61-70970-0024/official.wav" autoplay/>Your browser does not support the audio element.</audio></td>
|
151 |
<td style="text-align: center"><audio controls="controls" style="width: 140px;"><source src="audios/librispeech/61-70970-0024/libritts.wav" autoplay/>Your browser does not support the audio element.</audio></td>
|
|
|
|
|
152 |
</tr>
|
153 |
<tr>
|
154 |
<td style="text-align: left;vertical-align:middle;width: 500px">And lay me down in thy cold bed and leave my shining lot.</td>
|
@@ -156,6 +160,8 @@
|
|
156 |
<td style="text-align: center"><audio controls="controls" style="width: 140px;"><source src="audios/librispeech/908-157963-0027/gt.wav" autoplay/>Your browser does not support the audio element.</audio></td>
|
157 |
<td style="text-align: center"><audio controls="controls" style="width: 140px;"><source src="audios/librispeech/908-157963-0027/official.wav" autoplay/>Your browser does not support the audio element.</audio></td>
|
158 |
<td style="text-align: center"><audio controls="controls" style="width: 140px;"><source src="audios/librispeech/908-157963-0027/libritts.wav" autoplay/>Your browser does not support the audio element.</audio></td>
|
|
|
|
|
159 |
</tr>
|
160 |
<tr>
|
161 |
<td style="text-align: left;vertical-align:middle;width: 500px">Number ten, fresh nelly is waiting on you, good night husband.</td>
|
@@ -163,6 +169,8 @@
|
|
163 |
<td style="text-align: center"><audio controls="controls" style="width: 140px;"><source src="audios/librispeech/1089-134686-0004/gt.wav" autoplay/>Your browser does not support the audio element.</audio></td>
|
164 |
<td style="text-align: center"><audio controls="controls" style="width: 140px;"><source src="audios/librispeech/1089-134686-0004/official.wav" autoplay/>Your browser does not support the audio element.</audio></td>
|
165 |
<td style="text-align: center"><audio controls="controls" style="width: 140px;"><source src="audios/librispeech/1089-134686-0004/libritts.wav" autoplay/>Your browser does not support the audio element.</audio></td>
|
|
|
|
|
166 |
</tr>
|
167 |
<tr>
|
168 |
<td style="text-align: left;vertical-align:middle;width: 500px">Yea, his honourable worship is within, but he hath a godly minister or two with him, and likewise a leech.</td>
|
@@ -170,6 +178,8 @@
|
|
170 |
<td style="text-align: center"><audio controls="controls" style="width: 140px;"><source src="audios/librispeech/1221-135767-0014/gt.wav" autoplay/>Your browser does not support the audio element.</audio></td>
|
171 |
<td style="text-align: center"><audio controls="controls" style="width: 140px;"><source src="audios/librispeech/1221-135767-0014/official.wav" autoplay/>Your browser does not support the audio element.</audio></td>
|
172 |
<td style="text-align: center"><audio controls="controls" style="width: 140px;"><source src="audios/librispeech/1221-135767-0014/libritts.wav" autoplay/>Your browser does not support the audio element.</audio></td>
|
|
|
|
|
173 |
</tr>
|
174 |
</tbody>
|
175 |
</table>
|
@@ -177,7 +187,7 @@
|
|
177 |
</div>
|
178 |
|
179 |
|
180 |
-
<div class="container pt-5 mt-5 shadow p-5 mb-5 bg-white rounded">
|
181 |
<h2 id="Acoustic-Environment-Maintenance" style="text-align: center;">Acoustic Environment Maintenance</h2>
|
182 |
|
183 |
<p>
|
@@ -287,7 +297,7 @@
|
|
287 |
<p>
|
288 |
To avoid abuse, Well-trained models and services will not be provided.
|
289 |
</p>
|
290 |
-
</div>
|
291 |
|
292 |
</article>
|
293 |
</main>
|
|
|
40 |
In addition, we find VALL-E could preserve the speaker's emotion and acoustic environment of the acoustic prompt in synthesis.
|
41 |
</p>
|
42 |
|
43 |
+
<!-- <br>
|
44 |
official demo page: <a href="https://valle-demo.github.io/">https://valle-demo.github.io</a>
|
45 |
<br>
|
46 |
|
|
|
48 |
my implementation: <a href="https://github.com/lifeiteng/vall-e">https://github.com/lifeiteng/vall-e</a>
|
49 |
<br><br>
|
50 |
This page is for showing reproduced results only, I keep the main parts of the official demo.
|
51 |
+
<br><br> -->
|
52 |
|
53 |
+
<!-- <h2 id="model-configs" style="text-align: center;">Model Configs</h2>
|
54 |
<div class="table-responsive pt-3">
|
55 |
<table class="table table-hover pt-2">
|
56 |
<thead>
|
|
|
81 |
</tr>
|
82 |
</tbody>
|
83 |
</table>
|
84 |
+
</div> -->
|
85 |
|
86 |
</div>
|
87 |
|
88 |
+
<!-- <div class="container pt-5 mt-5 shadow p-5 mb-5 bg-white rounded">
|
89 |
<h2 id="model-overview" style="text-align: center;">Model Overview</h2>
|
90 |
<body>
|
91 |
<p style="text-align: center;">
|
|
|
98 |
VALL-E generates the discrete audio codec codes based on phoneme and acoustic code prompts, corresponding to the target content and the speaker's voice.
|
99 |
VALL-E directly enables various speech synthesis applications, such as zero-shot TTS, speech editing, and content creation combined with other generative AI models like GPT-3.
|
100 |
</p>
|
101 |
+
</div> -->
|
102 |
|
103 |
+
<!-- <div class="container pt-5 mt-5 shadow p-5 mb-5 bg-white rounded">
|
104 |
<h2 id="ljspeech-samples" style="text-align: center;">LJSpeech Samples</h2>
|
105 |
<div class="table-responsive pt-3">
|
106 |
<table class="table table-hover pt-2">
|
|
|
127 |
</tbody>
|
128 |
</table>
|
129 |
</div>
|
130 |
+
</div> -->
|
131 |
|
132 |
|
133 |
<div class="container pt-5 mt-5 shadow p-5 mb-5 bg-white rounded">
|
|
|
140 |
<th style="text-align: center">Speaker Prompt</th>
|
141 |
<th style="text-align: center">Ground Truth</th>
|
142 |
<th style="text-align: center">VALL-E</th>
|
143 |
+
<th style="text-align: center">LibriTTS feiteng</th>
|
144 |
+
<th style="text-align: center">LibirTTS ours</th>
|
145 |
+
<th style="text-align: center">LibriTTS-R ours</th>
|
146 |
</tr>
|
147 |
</thead>
|
148 |
<tbody>
|
|
|
151 |
<td style="text-align: center"><audio controls="controls" style="width: 140px;"><source src="audios/librispeech/61-70970-0024/gt.wav" autoplay/>Your browser does not support the audio element.</audio></td>
|
152 |
<td style="text-align: center"><audio controls="controls" style="width: 140px;"><source src="audios/librispeech/61-70970-0024/official.wav" autoplay/>Your browser does not support the audio element.</audio></td>
|
153 |
<td style="text-align: center"><audio controls="controls" style="width: 140px;"><source src="audios/librispeech/61-70970-0024/libritts.wav" autoplay/>Your browser does not support the audio element.</audio></td>
|
154 |
+
<td style="text-align: center"><audio controls="controls" style="width: 140px;"><source src="demos_35epoch_valid_best_rerun/audios/librispeech/61-70970-0024/libritts_yk.wav" autoplay/>Your browser does not support the audio element.</audio></td>
|
155 |
+
<td style="text-align: center"><audio controls="controls" style="width: 140px;"><source src="demos_libritts_r_best_valid_35epoch/audios/librispeech/61-70970-0024/libritts_yk.wav" autoplay/>Your browser does not support the audio element.</audio></td>
|
156 |
</tr>
|
157 |
<tr>
|
158 |
<td style="text-align: left;vertical-align:middle;width: 500px">And lay me down in thy cold bed and leave my shining lot.</td>
|
|
|
160 |
<td style="text-align: center"><audio controls="controls" style="width: 140px;"><source src="audios/librispeech/908-157963-0027/gt.wav" autoplay/>Your browser does not support the audio element.</audio></td>
|
161 |
<td style="text-align: center"><audio controls="controls" style="width: 140px;"><source src="audios/librispeech/908-157963-0027/official.wav" autoplay/>Your browser does not support the audio element.</audio></td>
|
162 |
<td style="text-align: center"><audio controls="controls" style="width: 140px;"><source src="audios/librispeech/908-157963-0027/libritts.wav" autoplay/>Your browser does not support the audio element.</audio></td>
|
163 |
+
<td style="text-align: center"><audio controls="controls" style="width: 140px;"><source src="demos_35epoch_valid_best_rerun/audios/librispeech/61-70970-0024/libritts_yk.wav" autoplay/>Your browser does not support the audio element.</audio></td>
|
164 |
+
<td style="text-align: center"><audio controls="controls" style="width: 140px;"><source src="demos_libritts_r_best_valid_35epoch/audios/librispeech/61-70970-0024/libritts_yk.wav" autoplay/>Your browser does not support the audio element.</audio></td>
|
165 |
</tr>
|
166 |
<tr>
|
167 |
<td style="text-align: left;vertical-align:middle;width: 500px">Number ten, fresh nelly is waiting on you, good night husband.</td>
|
|
|
169 |
<td style="text-align: center"><audio controls="controls" style="width: 140px;"><source src="audios/librispeech/1089-134686-0004/gt.wav" autoplay/>Your browser does not support the audio element.</audio></td>
|
170 |
<td style="text-align: center"><audio controls="controls" style="width: 140px;"><source src="audios/librispeech/1089-134686-0004/official.wav" autoplay/>Your browser does not support the audio element.</audio></td>
|
171 |
<td style="text-align: center"><audio controls="controls" style="width: 140px;"><source src="audios/librispeech/1089-134686-0004/libritts.wav" autoplay/>Your browser does not support the audio element.</audio></td>
|
172 |
+
<td style="text-align: center"><audio controls="controls" style="width: 140px;"><source src="demos_35epoch_valid_best_rerun/audios/librispeech/61-70970-0024/libritts_yk.wav" autoplay/>Your browser does not support the audio element.</audio></td>
|
173 |
+
<td style="text-align: center"><audio controls="controls" style="width: 140px;"><source src="demos_libritts_r_best_valid_35epoch/audios/librispeech/61-70970-0024/libritts_yk.wav" autoplay/>Your browser does not support the audio element.</audio></td>
|
174 |
</tr>
|
175 |
<tr>
|
176 |
<td style="text-align: left;vertical-align:middle;width: 500px">Yea, his honourable worship is within, but he hath a godly minister or two with him, and likewise a leech.</td>
|
|
|
178 |
<td style="text-align: center"><audio controls="controls" style="width: 140px;"><source src="audios/librispeech/1221-135767-0014/gt.wav" autoplay/>Your browser does not support the audio element.</audio></td>
|
179 |
<td style="text-align: center"><audio controls="controls" style="width: 140px;"><source src="audios/librispeech/1221-135767-0014/official.wav" autoplay/>Your browser does not support the audio element.</audio></td>
|
180 |
<td style="text-align: center"><audio controls="controls" style="width: 140px;"><source src="audios/librispeech/1221-135767-0014/libritts.wav" autoplay/>Your browser does not support the audio element.</audio></td>
|
181 |
+
<td style="text-align: center"><audio controls="controls" style="width: 140px;"><source src="demos_35epoch_valid_best_rerun/audios/librispeech/61-70970-0024/libritts_yk.wav" autoplay/>Your browser does not support the audio element.</audio></td>
|
182 |
+
<td style="text-align: center"><audio controls="controls" style="width: 140px;"><source src="demos_libritts_r_best_valid_35epoch/audios/librispeech/61-70970-0024/libritts_yk.wav" autoplay/>Your browser does not support the audio element.</audio></td>
|
183 |
</tr>
|
184 |
</tbody>
|
185 |
</table>
|
|
|
187 |
</div>
|
188 |
|
189 |
|
190 |
+
<!-- <div class="container pt-5 mt-5 shadow p-5 mb-5 bg-white rounded">
|
191 |
<h2 id="Acoustic-Environment-Maintenance" style="text-align: center;">Acoustic Environment Maintenance</h2>
|
192 |
|
193 |
<p>
|
|
|
297 |
<p>
|
298 |
To avoid abuse, Well-trained models and services will not be provided.
|
299 |
</p>
|
300 |
+
</div> -->
|
301 |
|
302 |
</article>
|
303 |
</main>
|