wr commited on
Commit
c581410
1 Parent(s): 93d74ac

add samples and pretrained checkpoint

Browse files
README.md CHANGED
@@ -26,7 +26,12 @@ This manifest is an attempt to recreate the Text-to-Speech recipe used for train
26
  ### Tools
27
 
28
  - [manifest/utils](./manifest/utils/) is used to downsample waveform, extract speaker embedding, generate manifest, and apply vocoder.
29
- - [pretrained_vocoder](./pretrained_vocoder/) provides the pre-trained vocoder.
 
 
 
 
 
30
 
31
  ### Reference
32
 
 
26
  ### Tools
27
 
28
  - [manifest/utils](./manifest/utils/) is used to downsample waveform, extract speaker embedding, generate manifest, and apply vocoder.
29
+ - [pretrained_vocoder](./pretrained_vocoder/) provides the pre-trained vocoder.
30
+
31
+ ### Model and Samples
32
+
33
+ - [speecht5_tts.pt](./speecht5_tts.pt) are reimplemented Voice Conversion fine-tuning on the released manifest **but with a smaller batch size or max updates** (Ensure the manifest is ok).
34
+ - [samples](./samples/) are created by the released fine-tuned model and vocoder.
35
 
36
  ### Reference
37
 
samples/.DS_Store ADDED
Binary file (6.15 kB). View file
 
samples/121_121726_000020_000001_gen.wav ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:39ad3be797066e764f94b9966f231142ae8b3ee0c608b714699d162d762eb227
3
+ size 32812
samples/237_134493_000021_000002_gen.wav ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ee0fb2e1b9035b980c4d15ea51de77c7a82b73a495aed9ae2c4d2dfa76338a9b
3
+ size 193580
samples/260_123286_000038_000001_gen.wav ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:050b3400319535e4ff9a5d493e363bf070d81eacb414a094406fd9109fcf030d
3
+ size 182316
samples/gen_wav200.tar.gz ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4ff3458af0c41c4f466ebffc29cca112bcfd7639408c4ba3e86a07eb0c428cdd
3
+ size 36323514
speecht5_tts.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2a1fb04815fe33e7b6f765270e99ea6353cc98758aba82a195c18dfe0ffbf7ee
3
+ size 616005677