ASR for low resource language with this model
#1
by
RichD
- opened
Is there a way to verify the ASR performance of this model with some example audio? How to restrict the output if the input audio language is known? For audio to text alignment, what text encoding should be used for the transcription? Thanks