numpy<2.0.0 numba torch>=2.1.0 torchaudio tqdm vector_quantize_pytorch transformers>=4.41.1 vocos IPython gradio pybase16384 pynini==2.1.5; sys_platform == 'linux' WeTextProcessing; sys_platform == 'linux' nemo_text_processing; sys_platform == 'linux' av pydub fastapi[standard] requests uvicorn[standard]