FILM6912
/

Whisper-small-thai

Automatic Speech Recognition

Generated from Trainer

Model card Files Files and versions Metrics Training metrics Community

FILM6912 commited on Aug 3

Commit

0b5ef14

•

1 Parent(s): d87efdf

Update README.md

Files changed (1) hide show

README.md +22 -1

README.md CHANGED Viewed

@@ -37,7 +37,28 @@ This model is a fine-tuned version of [biodatlab/whisper-th-small-combined](http
 ## Model description
-More information needed
 ## Intended uses & limitations

 ## Model description
+Use the model with huggingface's `transformers` as follows:
+```py
+from transformers import pipeline
+MODEL_NAME = "FILM6912/Whisper-small-thai"  # specify the model name
+lang = "th"  # change to Thai langauge
+device = 0 if torch.cuda.is_available() else "cpu"
+pipe = pipeline(
+    task="automatic-speech-recognition",
+    model=MODEL_NAME,
+    chunk_length_s=30,
+    device=device,
+)
+pipe.model.config.forced_decoder_ids = pipe.tokenizer.get_decoder_prompt_ids(
+  language=lang,
+  task="transcribe"
+)
+text = pipe("audio.mp3")["text"] # give audio mp3 and transcribe text
+```
 ## Intended uses & limitations