readme: | # Fish Diffusion - HiFiSinger Demo 🎤 GitHub Repo: [fishaudio/fish-diffusion](https://github.com/fishaudio/fish-diffusion) To share a new model, please check out the [Share Your Model](https://huggingface.co/spaces/fishaudio/fish-diffusion/discussions/2) discussion. max_mixing_speakers: 3 models: - name: "M4Singer Pretrained (Many Speakers, Alto, Tenor, Soprano, Bass)" config: configs/M4Singer.py checkpoint: checkpoints/M4Singer.ckpt readme: | This model is trained on the Opencpop and M4Singer dataset and released under the [CC-BY-NC-SA 4.0](https://creativecommons.org/licenses/by-nc-sa/4.0/) license. It contains more than 20 speakers and is thus a good playground for timbre mixing. default_speaker: "opencpop" - name: "Tohoku Kiritan (Feminine)" config: configs/Kiritan.py checkpoint: checkpoints/Kiritan.ckpt readme: | This model is trained on the Tohoku Kiritan dataset and released under the [CC-BY-NC-SA 4.0](https://creativecommons.org/licenses/by-nc-sa/4.0/) license. It has a cute, yet powerful voice. CV: Akaneya Himika default_speaker: "kiritan" - name: "Tohoku Itako (Feminine)" config: configs/Itako.py checkpoint: checkpoints/Itako.ckpt readme: | This model is trained on the Tohoku Itako dataset and released under the [CC-BY-NC-SA 4.0](https://creativecommons.org/licenses/by-nc-sa/4.0/) license. It has a bright and whispery voice. CV: Kido Ibuki default_speaker: "itako" - name: "No.7 (Feminine)" config: configs/Seven.py checkpoint: checkpoints/Seven.ckpt readme: | This model is trained on the No.7 dataset and released under the [CC-BY-NC-SA 4.0](https://creativecommons.org/licenses/by-nc-sa/4.0/) license. It has a strong and sharp voice. CV: Koiwai Kotori default_speaker: "seven" - name: "Yoko (Feminine)" config: configs/Yoko.py checkpoint: checkpoints/Yoko.ckpt readme: | This model is trained on the Sinsy-f00001 dataset and released under the [CC-BY-NC-SA 4.0](https://creativecommons.org/licenses/by-nc-sa/4.0/) license. It has a powerful, tense, and relaxed voice. default_speaker: "yoko" - name: "JSUT (Feminine)" config: configs/JSUT.py checkpoint: checkpoints/JSUT.ckpt readme: | This model is trained on the JSUT-song dataset and released under the [CC-BY-NC-SA 4.0](https://creativecommons.org/licenses/by-nc-sa/4.0/) license. It has a moist and transparent voice. default_speaker: "jsut" - name: "CSD (Feminine)" config: configs/CSD.py checkpoint: checkpoints/CSD.ckpt readme: | This model is trained on the Children's Song Dataset and released under the [CC-BY-NC-SA 4.0](https://creativecommons.org/licenses/by-nc-sa/4.0/) license. It has a sweet and tender voice. default_speaker: "csd" - name: "Namine Ritsu (Feminine)" config: configs/Ritsu.py checkpoint: checkpoints/Ritsu.ckpt readme: | This model is trained on the Namine Ritsu ENUNU Dataset and released under the [CC-BY-NC-SA 4.0](https://creativecommons.org/licenses/by-nc-sa/4.0/) license. It has a powerful and throaty voice. CV: Canon default_speaker: "ritsu" - name: "S (Masculine)" config: configs/S.py checkpoint: checkpoints/S.ckpt readme: | This model is trained on a datset known as S and released under the [CC-BY-NC-SA 4.0](https://creativecommons.org/licenses/by-nc-sa/4.0/) license. It has a nasally but powerful voice. default_speaker: "s" - name: "C (Feminine)" config: configs/C.py checkpoint: checkpoints/C.ckpt readme: | This model is trained on a datset known as C and released under the [CC-BY-NC-SA 4.0](https://creativecommons.org/licenses/by-nc-sa/4.0/) license. It has a whispery, fluttery voice. default_speaker: "c" - name: "Azure Cobalt (Feminine)" config: configs/Azure.py checkpoint: checkpoints/Azure.ckpt readme: | This model is trained on a dataset known as Azure Cobalt and released under the [CC-BY-NC-SA 4.0](https://creativecommons.org/licenses/by-nc-sa/4.0/) license. It has a stable, mature voice. CV: Aster default_speaker: "azure" - name: "ALYS (Feminine)" config: configs/ALYS.py checkpoint: checkpoints/ALYS.ckpt readme: | This model is trained on the ALYS DB 001 JPN dataset, originally produced by Voxwave and released under the [GPL-3.0](https://choosealicense.com/licenses/gpl-3.0/) license. It has a slightly soft voice. CV: Poucet default_speaker: "ALYS"