litagin commited on
Commit
06cac5c
β€’
1 Parent(s): cf61580
Files changed (2) hide show
  1. app.py +9 -2
  2. requirements.txt +1 -0
app.py CHANGED
@@ -200,7 +200,12 @@ def tts(
200
  edge_output_filename,
201
  (tgt_sr, audio_opt),
202
  )
203
-
 
 
 
 
 
204
  except:
205
  info = traceback.format_exc()
206
  print(info)
@@ -214,11 +219,13 @@ This is a text-to-speech demo of RVC moe models of [rvc_okiba](https://huggingfa
214
 
215
  Input text ➑[(edge-tts)](https://github.com/rany2/edge-tts)➑ Speech mp3 file ➑[(RVC)](https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI)➑ Final output
216
 
 
 
217
  Although the models are trained on Japanese voices and intended for Japanese text, they can also be used with other languages with the corresponding edge-tts speaker (but possibly with a Japanese accent).
218
 
219
  Input characters are limited to 280 characters, and the speech audio is limited to 20 seconds in this πŸ€— space.
220
 
221
- [Visit this GitHub repo](https://github.com/litagin02/rvc-tts-webui) for running locally with your models!
222
  """
223
 
224
  app = gr.Blocks()
 
200
  edge_output_filename,
201
  (tgt_sr, audio_opt),
202
  )
203
+ except EOFError:
204
+ info = """
205
+ It seems that edge-tts output is empty. This may occur when the input text and the speaker do not match.
206
+ For example, maybe you entered Japanese (without alphabets) text but chose non-Japanese speaker?"""
207
+ print(info)
208
+ return info, None, None
209
  except:
210
  info = traceback.format_exc()
211
  print(info)
 
219
 
220
  Input text ➑[(edge-tts)](https://github.com/rany2/edge-tts)➑ Speech mp3 file ➑[(RVC)](https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI)➑ Final output
221
 
222
+ This runs on the πŸ€— server's cpu, so it may be slow.
223
+
224
  Although the models are trained on Japanese voices and intended for Japanese text, they can also be used with other languages with the corresponding edge-tts speaker (but possibly with a Japanese accent).
225
 
226
  Input characters are limited to 280 characters, and the speech audio is limited to 20 seconds in this πŸ€— space.
227
 
228
+ [Visit this GitHub repo](https://github.com/litagin02/rvc-tts-webui) for running locally with your models and GPU!
229
  """
230
 
231
  app = gr.Blocks()
requirements.txt CHANGED
@@ -2,6 +2,7 @@ edge_tts==6.1.7
2
  fairseq==0.12.2
3
  faiss_cpu==1.7.4
4
  gradio==3.36.1
 
5
  librosa==0.9.1
6
  numpy==1.23.5
7
  praat-parselmouth==0.4.3
 
2
  fairseq==0.12.2
3
  faiss_cpu==1.7.4
4
  gradio==3.36.1
5
+ gradio_client==0.2.9
6
  librosa==0.9.1
7
  numpy==1.23.5
8
  praat-parselmouth==0.4.3