Rongjiehuang commited on
Commit
92898ea
0 Parent(s):
This view is limited to 50 files because it contains too many changes.   See raw diff
Files changed (50) hide show
  1. README.md +43 -0
  2. data/.DS_Store +0 -0
  3. data/en_speech_libritts_50_t2s/.DS_Store +0 -0
  4. data/en_speech_libritts_50_t2s/build/.DS_Store +0 -0
  5. data/en_speech_libritts_50_t2s/build/acoustic/1089_134686_000029_000001.npy +0 -0
  6. data/en_speech_libritts_50_t2s/build/acoustic/1284_1181_000005_000000.npy +0 -0
  7. data/en_speech_libritts_50_t2s/build/acoustic/1284_1181_000009_000000.npy +0 -0
  8. data/en_speech_libritts_50_t2s/build/acoustic/1580_141083_000093_000000.npy +0 -0
  9. data/en_speech_libritts_50_t2s/build/acoustic/1580_141083_000113_000000.npy +0 -0
  10. data/en_speech_libritts_50_t2s/build/acoustic/1580_141084_000024_000002.npy +0 -0
  11. data/en_speech_libritts_50_t2s/build/acoustic/1995_1826_000005_000003.npy +0 -0
  12. data/en_speech_libritts_50_t2s/build/acoustic/1995_1826_000038_000001.npy +0 -0
  13. data/en_speech_libritts_50_t2s/build/acoustic/2300_131720_000004_000003.npy +0 -0
  14. data/en_speech_libritts_50_t2s/build/acoustic/2300_131720_000027_000006.npy +0 -0
  15. data/en_speech_libritts_50_t2s/build/acoustic/237_126133_000046_000000.npy +0 -0
  16. data/en_speech_libritts_50_t2s/build/acoustic/237_134493_000007_000000.npy +0 -0
  17. data/en_speech_libritts_50_t2s/build/acoustic/237_134493_000017_000010.npy +0 -0
  18. data/en_speech_libritts_50_t2s/build/acoustic/237_134500_000005_000004.npy +0 -0
  19. data/en_speech_libritts_50_t2s/build/acoustic/260_123288_000023_000005.npy +0 -0
  20. data/en_speech_libritts_50_t2s/build/acoustic/260_123440_000024_000002.npy +0 -0
  21. data/en_speech_libritts_50_t2s/build/acoustic/2830_3980_000022_000000.npy +0 -0
  22. data/en_speech_libritts_50_t2s/build/acoustic/3570_5694_000010_000007.npy +0 -0
  23. data/en_speech_libritts_50_t2s/build/acoustic/3570_5696_000011_000007.npy +0 -0
  24. data/en_speech_libritts_50_t2s/build/acoustic/4446_2271_000005_000000.npy +0 -0
  25. data/en_speech_libritts_50_t2s/build/acoustic/4446_2273_000010_000002.npy +0 -0
  26. data/en_speech_libritts_50_t2s/build/acoustic/4446_2273_000040_000004.npy +0 -0
  27. data/en_speech_libritts_50_t2s/build/acoustic/4446_2275_000002_000004.npy +0 -0
  28. data/en_speech_libritts_50_t2s/build/acoustic/4507_16021_000021_000001.npy +0 -0
  29. data/en_speech_libritts_50_t2s/build/acoustic/4507_16021_000025_000000.npy +0 -0
  30. data/en_speech_libritts_50_t2s/build/acoustic/4507_16021_000035_000008.npy +0 -0
  31. data/en_speech_libritts_50_t2s/build/acoustic/4970_29093_000038_000002.npy +0 -0
  32. data/en_speech_libritts_50_t2s/build/acoustic/4992_41797_000008_000002.npy +0 -0
  33. data/en_speech_libritts_50_t2s/build/acoustic/5142_36377_000008_000001.npy +0 -0
  34. data/en_speech_libritts_50_t2s/build/acoustic/5639_40744_000029_000006.npy +0 -0
  35. data/en_speech_libritts_50_t2s/build/acoustic/5683_32865_000010_000001.npy +0 -0
  36. data/en_speech_libritts_50_t2s/build/acoustic/5683_32879_000048_000003.npy +0 -0
  37. data/en_speech_libritts_50_t2s/build/acoustic/6829_68771_000038_000002.npy +0 -0
  38. data/en_speech_libritts_50_t2s/build/acoustic/6930_81414_000028_000001.npy +0 -0
  39. data/en_speech_libritts_50_t2s/build/acoustic/7021_79740_000002_000000.npy +0 -0
  40. data/en_speech_libritts_50_t2s/build/acoustic/7127_75946_000017_000000.npy +0 -0
  41. data/en_speech_libritts_50_t2s/build/acoustic/7127_75947_000006_000005.npy +0 -0
  42. data/en_speech_libritts_50_t2s/build/acoustic/7176_92135_000006_000008.npy +0 -0
  43. data/en_speech_libritts_50_t2s/build/acoustic/7176_92135_000069_000002.npy +0 -0
  44. data/en_speech_libritts_50_t2s/build/acoustic/7729_102255_000016_000000.npy +0 -0
  45. data/en_speech_libritts_50_t2s/build/acoustic/8230_279154_000011_000008.npy +0 -0
  46. data/en_speech_libritts_50_t2s/build/acoustic/8230_279154_000013_000003.npy +0 -0
  47. data/en_speech_libritts_50_t2s/build/acoustic/8455_210777_000002_000002.npy +0 -0
  48. data/en_speech_libritts_50_t2s/build/acoustic/8463_294825_000040_000000.npy +0 -0
  49. data/en_speech_libritts_50_t2s/build/acoustic/8463_294825_000044_000000.npy +0 -0
  50. data/en_speech_libritts_50_t2s/build/acoustic/8463_294828_000036_000000.npy +0 -0
README.md ADDED
@@ -0,0 +1,43 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: other
3
+ tags:
4
+ - text-to-speech
5
+ - LLMs
6
+ - zero-shot text-to-speech
7
+ inference: false
8
+ datasets:
9
+ - LJSpeech
10
+ extra_gated_prompt: |-
11
+ One more step before getting this model.
12
+ This model is open access and available to all, with a license further specifying rights and usage.
13
+
14
+ Any organization or individual is prohibited from using any technology mentioned in this paper to generate someone's speech without his/her consent, including but not limited to government leaders, political figures, and celebrities. If you do not comply with this item, you could be in violation of copyright laws.
15
+
16
+
17
+ By clicking on "Access repository" below, you accept that your *contact information* (email address and username) can be shared with the model authors as well.
18
+
19
+ extra_gated_fields:
20
+ I have read the License and agree with its terms: checkbox
21
+ ---
22
+
23
+ # MVoice Model Card
24
+
25
+
26
+ ## Model Details
27
+ - **Model type:** Voice LLM for Zero-shot text-to-speech
28
+ - **Language(s):** English, Mandarin
29
+ - **Resources for more information:** [MVoice GitHub Repository](https://github.com/Rongjiehuang/MVoice), [MVoice Paper]().
30
+ - **Cite as:**
31
+
32
+ ```bib
33
+ @article{huang2023make,
34
+ title={Make-A-Voice: Unified Voice Synthesis With Discrete Representation},
35
+ author={Huang, Rongjie and Zhang, Chunlei and Wang, Yongqi and Yang, Dongchao and Liu, Luping and Ye, Zhenhui and Jiang, Ziyue and Weng, Chao and Zhao, Zhou and Yu, Dong},
36
+ journal={arXiv preprint arXiv:2305.19269},
37
+ year={2023}
38
+ }
39
+ ```
40
+ -
41
+
42
+
43
+ *This model card was written based on the [DALL-E Mini model card](https://huggingface.co/dalle-mini/dalle-mini).*
data/.DS_Store ADDED
Binary file (8.2 kB). View file
 
data/en_speech_libritts_50_t2s/.DS_Store ADDED
Binary file (6.15 kB). View file
 
data/en_speech_libritts_50_t2s/build/.DS_Store ADDED
Binary file (6.15 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/1089_134686_000029_000001.npy ADDED
Binary file (1.4 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/1284_1181_000005_000000.npy ADDED
Binary file (4.16 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/1284_1181_000009_000000.npy ADDED
Binary file (2.94 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/1580_141083_000093_000000.npy ADDED
Binary file (2.91 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/1580_141083_000113_000000.npy ADDED
Binary file (4.64 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/1580_141084_000024_000002.npy ADDED
Binary file (3.63 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/1995_1826_000005_000003.npy ADDED
Binary file (14.1 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/1995_1826_000038_000001.npy ADDED
Binary file (1.45 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/2300_131720_000004_000003.npy ADDED
Binary file (22.9 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/2300_131720_000027_000006.npy ADDED
Binary file (2.12 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/237_126133_000046_000000.npy ADDED
Binary file (6.44 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/237_134493_000007_000000.npy ADDED
Binary file (7.3 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/237_134493_000017_000010.npy ADDED
Binary file (4.88 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/237_134500_000005_000004.npy ADDED
Binary file (1.57 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/260_123288_000023_000005.npy ADDED
Binary file (7.74 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/260_123440_000024_000002.npy ADDED
Binary file (6.32 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/2830_3980_000022_000000.npy ADDED
Binary file (5.41 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/3570_5694_000010_000007.npy ADDED
Binary file (4.52 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/3570_5696_000011_000007.npy ADDED
Binary file (4.86 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/4446_2271_000005_000000.npy ADDED
Binary file (1.54 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/4446_2273_000010_000002.npy ADDED
Binary file (2 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/4446_2273_000040_000004.npy ADDED
Binary file (800 Bytes). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/4446_2275_000002_000004.npy ADDED
Binary file (7.16 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/4507_16021_000021_000001.npy ADDED
Binary file (11.6 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/4507_16021_000025_000000.npy ADDED
Binary file (37.1 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/4507_16021_000035_000008.npy ADDED
Binary file (2.22 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/4970_29093_000038_000002.npy ADDED
Binary file (1.81 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/4992_41797_000008_000002.npy ADDED
Binary file (2.31 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/5142_36377_000008_000001.npy ADDED
Binary file (4.74 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/5639_40744_000029_000006.npy ADDED
Binary file (13.1 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/5683_32865_000010_000001.npy ADDED
Binary file (16.1 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/5683_32879_000048_000003.npy ADDED
Binary file (1.54 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/6829_68771_000038_000002.npy ADDED
Binary file (1.23 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/6930_81414_000028_000001.npy ADDED
Binary file (2.46 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/7021_79740_000002_000000.npy ADDED
Binary file (2.58 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/7127_75946_000017_000000.npy ADDED
Binary file (1.26 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/7127_75947_000006_000005.npy ADDED
Binary file (18.8 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/7176_92135_000006_000008.npy ADDED
Binary file (8.41 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/7176_92135_000069_000002.npy ADDED
Binary file (10.8 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/7729_102255_000016_000000.npy ADDED
Binary file (4.33 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/8230_279154_000011_000008.npy ADDED
Binary file (9.13 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/8230_279154_000013_000003.npy ADDED
Binary file (2.31 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/8455_210777_000002_000002.npy ADDED
Binary file (5.98 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/8463_294825_000040_000000.npy ADDED
Binary file (4.98 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/8463_294825_000044_000000.npy ADDED
Binary file (5.31 kB). View file
 
data/en_speech_libritts_50_t2s/build/acoustic/8463_294828_000036_000000.npy ADDED
Binary file (2.38 kB). View file