FastSpeech2Conformer HiFi-GAN Vocoder

This is the HiFi-GAN vocoder for use with the FastSpeech2Conformer text-to-speech and voice conversion models.

The FastSpeech2Conformer model was proposed with the paper Recent Developments On Espnet Toolkit Boosted By Conformer by Pengcheng Guo, Florian Boyer, Xuankai Chang, Tomoki Hayashi, Yosuke Higuchi, Hirofumi Inaguma, Naoyuki Kamo, Chenda Li, Daniel Garcia-Romero, Jiatong Shi, Jing Shi, Shinji Watanabe, Kun Wei, Wangyou Zhang, and Yuekai Zhang. It was first released in this repository. The license used is Apache 2.0.

Disclaimer: The team releasing FastSpeech2Conformer did not write a model card for this model, so this was written by a Hugging Face contributor.