keithhon's picture
Upload samples/README.md with huggingface_hub
c37a507
|
raw
history blame
943 Bytes

The audio files in this folder are provided for toolbox testing and benchmarking purposes. These are the same reference utterances used by the SV2TTS authors to generate the audio samples located at: https://google.github.io/tacotron/publications/speaker_adaptation/index.html

The p240_00000.mp3 and p260_00000.mp3 files are compressed versions of audios from the VCTK corpus available at: https://datashare.is.ed.ac.uk/handle/10283/3443 VCTK.txt contains the copyright notices and licensing information.

The 1320_00000.mp3, 3575_00000.mp3, 6829_00000.mp3 and 8230_00000.mp3 files are compressed versions of audios from the LibriSpeech dataset available at: https://openslr.org/12 For these files, the following notice applies:

LibriSpeech (c) 2014 by Vassil Panayotov

LibriSpeech ASR corpus is licensed under a
Creative Commons Attribution 4.0 International License.

See <http://creativecommons.org/licenses/by/4.0/>.