--- language: - zh - tw pipeline_tag: text-to-speech base_model: Jackellie/ellie-Bert-VITS2 --- Taiwan accent TTS model from JackEllie. ## Usage Using this checkpoint from Hugging Face Transformers: ```python from transformers import AutoModel, AutoProcessor from scipy.io.wavfile import write import torch model_name = "BricksDisplay/ellie-Bert-VITS2" model = AutoModel.from_pretrained(model_name, trust_remote_code=True) processor = AutoProcessor.from_pretrained(model_name, trust_remote_code=True) with torch.no_grad(): inputs = processor("你好", language="zh", return_tensors="pt") result = model(**inputs) result = result["waveform"] write("output.wav", model.config.sampling_rate, result[0].numpy()) ```