Rongjiehuang
commited on
Commit
•
92898ea
0
Parent(s):
init
Browse filesThis view is limited to 50 files because it contains too many changes.
See raw diff
- README.md +43 -0
- data/.DS_Store +0 -0
- data/en_speech_libritts_50_t2s/.DS_Store +0 -0
- data/en_speech_libritts_50_t2s/build/.DS_Store +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/1089_134686_000029_000001.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/1284_1181_000005_000000.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/1284_1181_000009_000000.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/1580_141083_000093_000000.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/1580_141083_000113_000000.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/1580_141084_000024_000002.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/1995_1826_000005_000003.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/1995_1826_000038_000001.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/2300_131720_000004_000003.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/2300_131720_000027_000006.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/237_126133_000046_000000.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/237_134493_000007_000000.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/237_134493_000017_000010.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/237_134500_000005_000004.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/260_123288_000023_000005.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/260_123440_000024_000002.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/2830_3980_000022_000000.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/3570_5694_000010_000007.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/3570_5696_000011_000007.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/4446_2271_000005_000000.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/4446_2273_000010_000002.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/4446_2273_000040_000004.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/4446_2275_000002_000004.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/4507_16021_000021_000001.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/4507_16021_000025_000000.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/4507_16021_000035_000008.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/4970_29093_000038_000002.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/4992_41797_000008_000002.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/5142_36377_000008_000001.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/5639_40744_000029_000006.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/5683_32865_000010_000001.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/5683_32879_000048_000003.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/6829_68771_000038_000002.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/6930_81414_000028_000001.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/7021_79740_000002_000000.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/7127_75946_000017_000000.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/7127_75947_000006_000005.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/7176_92135_000006_000008.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/7176_92135_000069_000002.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/7729_102255_000016_000000.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/8230_279154_000011_000008.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/8230_279154_000013_000003.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/8455_210777_000002_000002.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/8463_294825_000040_000000.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/8463_294825_000044_000000.npy +0 -0
- data/en_speech_libritts_50_t2s/build/acoustic/8463_294828_000036_000000.npy +0 -0
README.md
ADDED
@@ -0,0 +1,43 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: other
|
3 |
+
tags:
|
4 |
+
- text-to-speech
|
5 |
+
- LLMs
|
6 |
+
- zero-shot text-to-speech
|
7 |
+
inference: false
|
8 |
+
datasets:
|
9 |
+
- LJSpeech
|
10 |
+
extra_gated_prompt: |-
|
11 |
+
One more step before getting this model.
|
12 |
+
This model is open access and available to all, with a license further specifying rights and usage.
|
13 |
+
|
14 |
+
Any organization or individual is prohibited from using any technology mentioned in this paper to generate someone's speech without his/her consent, including but not limited to government leaders, political figures, and celebrities. If you do not comply with this item, you could be in violation of copyright laws.
|
15 |
+
|
16 |
+
|
17 |
+
By clicking on "Access repository" below, you accept that your *contact information* (email address and username) can be shared with the model authors as well.
|
18 |
+
|
19 |
+
extra_gated_fields:
|
20 |
+
I have read the License and agree with its terms: checkbox
|
21 |
+
---
|
22 |
+
|
23 |
+
# MVoice Model Card
|
24 |
+
|
25 |
+
|
26 |
+
## Model Details
|
27 |
+
- **Model type:** Voice LLM for Zero-shot text-to-speech
|
28 |
+
- **Language(s):** English, Mandarin
|
29 |
+
- **Resources for more information:** [MVoice GitHub Repository](https://github.com/Rongjiehuang/MVoice), [MVoice Paper]().
|
30 |
+
- **Cite as:**
|
31 |
+
|
32 |
+
```bib
|
33 |
+
@article{huang2023make,
|
34 |
+
title={Make-A-Voice: Unified Voice Synthesis With Discrete Representation},
|
35 |
+
author={Huang, Rongjie and Zhang, Chunlei and Wang, Yongqi and Yang, Dongchao and Liu, Luping and Ye, Zhenhui and Jiang, Ziyue and Weng, Chao and Zhao, Zhou and Yu, Dong},
|
36 |
+
journal={arXiv preprint arXiv:2305.19269},
|
37 |
+
year={2023}
|
38 |
+
}
|
39 |
+
```
|
40 |
+
-
|
41 |
+
|
42 |
+
|
43 |
+
*This model card was written based on the [DALL-E Mini model card](https://huggingface.co/dalle-mini/dalle-mini).*
|
data/.DS_Store
ADDED
Binary file (8.2 kB). View file
|
|
data/en_speech_libritts_50_t2s/.DS_Store
ADDED
Binary file (6.15 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/.DS_Store
ADDED
Binary file (6.15 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/1089_134686_000029_000001.npy
ADDED
Binary file (1.4 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/1284_1181_000005_000000.npy
ADDED
Binary file (4.16 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/1284_1181_000009_000000.npy
ADDED
Binary file (2.94 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/1580_141083_000093_000000.npy
ADDED
Binary file (2.91 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/1580_141083_000113_000000.npy
ADDED
Binary file (4.64 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/1580_141084_000024_000002.npy
ADDED
Binary file (3.63 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/1995_1826_000005_000003.npy
ADDED
Binary file (14.1 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/1995_1826_000038_000001.npy
ADDED
Binary file (1.45 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/2300_131720_000004_000003.npy
ADDED
Binary file (22.9 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/2300_131720_000027_000006.npy
ADDED
Binary file (2.12 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/237_126133_000046_000000.npy
ADDED
Binary file (6.44 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/237_134493_000007_000000.npy
ADDED
Binary file (7.3 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/237_134493_000017_000010.npy
ADDED
Binary file (4.88 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/237_134500_000005_000004.npy
ADDED
Binary file (1.57 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/260_123288_000023_000005.npy
ADDED
Binary file (7.74 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/260_123440_000024_000002.npy
ADDED
Binary file (6.32 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/2830_3980_000022_000000.npy
ADDED
Binary file (5.41 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/3570_5694_000010_000007.npy
ADDED
Binary file (4.52 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/3570_5696_000011_000007.npy
ADDED
Binary file (4.86 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/4446_2271_000005_000000.npy
ADDED
Binary file (1.54 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/4446_2273_000010_000002.npy
ADDED
Binary file (2 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/4446_2273_000040_000004.npy
ADDED
Binary file (800 Bytes). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/4446_2275_000002_000004.npy
ADDED
Binary file (7.16 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/4507_16021_000021_000001.npy
ADDED
Binary file (11.6 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/4507_16021_000025_000000.npy
ADDED
Binary file (37.1 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/4507_16021_000035_000008.npy
ADDED
Binary file (2.22 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/4970_29093_000038_000002.npy
ADDED
Binary file (1.81 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/4992_41797_000008_000002.npy
ADDED
Binary file (2.31 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/5142_36377_000008_000001.npy
ADDED
Binary file (4.74 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/5639_40744_000029_000006.npy
ADDED
Binary file (13.1 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/5683_32865_000010_000001.npy
ADDED
Binary file (16.1 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/5683_32879_000048_000003.npy
ADDED
Binary file (1.54 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/6829_68771_000038_000002.npy
ADDED
Binary file (1.23 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/6930_81414_000028_000001.npy
ADDED
Binary file (2.46 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/7021_79740_000002_000000.npy
ADDED
Binary file (2.58 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/7127_75946_000017_000000.npy
ADDED
Binary file (1.26 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/7127_75947_000006_000005.npy
ADDED
Binary file (18.8 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/7176_92135_000006_000008.npy
ADDED
Binary file (8.41 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/7176_92135_000069_000002.npy
ADDED
Binary file (10.8 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/7729_102255_000016_000000.npy
ADDED
Binary file (4.33 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/8230_279154_000011_000008.npy
ADDED
Binary file (9.13 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/8230_279154_000013_000003.npy
ADDED
Binary file (2.31 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/8455_210777_000002_000002.npy
ADDED
Binary file (5.98 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/8463_294825_000040_000000.npy
ADDED
Binary file (4.98 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/8463_294825_000044_000000.npy
ADDED
Binary file (5.31 kB). View file
|
|
data/en_speech_libritts_50_t2s/build/acoustic/8463_294828_000036_000000.npy
ADDED
Binary file (2.38 kB). View file
|
|