fishaudio
/

fish-speech-1.2-sft

Model card Files Files and versions Community

Edit model card

Fish Speech V1.2

Fish Speech V1.2 is a leading text-to-speech (TTS) model trained on 300k hours of English, Chinese, and Japanese audio data.

Please refer to Fish Speech Github for more info.
Demo available at Fish Audio.

Citation

If you found this repository useful, please consider citing this work:

@misc{fish-speech-v1,
  author = {Shijia Liao, Tianyu Li},
  title = {Fish Speech V1},
  year = {2024},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/fishaudio/fish-speech}}
}

License

This model is permissively licensed under the BY-CC-NC-SA-4.0 license. The source code is released under BY-CC-NC-SA-4.0 license.

Downloads last month: 1,652

Inference Examples

Inference API (serverless) has been turned off for this model.

Space using fishaudio/fish-speech-1.2-sft 1