Spaces:

RaivisDejus
/

LatvianSpeechRecognition

Sleeping

App Files Files Community

Raivis Dejus commited on May 2

Commit

adb4549

•

1 Parent(s): 00936c5

Adjusting notes

Browse files

Files changed (1) hide show

app.py +11 -18

app.py CHANGED Viewed

@@ -107,21 +107,16 @@ transcribe = gr.Interface(
         gr.Radio([("Transcribe", "transcribe"), ("Translate to English", "translate",)], label="Task", value="transcribe"),
     ],
     outputs=gr.Textbox(label="Transcription", lines=15),
-    title="Latvian speech recognition: Transcribe Audio",
-    description=("""
-        Test Latvian speech recognition (STT) models. Three models are available on this demo.
-        <h2>tiny</h2>
-        [RaivisDejus/whisper-tiny-lv](https://huggingface.co/RaivisDejus/whisper-tiny-lv) - Fastest, requiring least RAM, but also poor accuracy. On this demo hardware 30 second audio will take ~45 seconds to transcribe.
-        <h2>small</h2>
-        [RaivisDejus/whisper-small-lv](https://huggingface.co/RaivisDejus/whisper-small-lv) - Reasonably fast, reasonably accurate, requiring reasonable amounts of RAM. On this demo hardware 30 second audio will take ~1 minute to transcribe.
-        <h2>large</h2>
-        [AiLab-IMCS-UL/whisper-large-v3-lv-late-cv17](https://huggingface.co/AiLab-IMCS-UL/whisper-large-v3-lv-late-cv17) - Most accurate, developed by scientists from [ailab.lv](https://ailab.lv/). Requires most RAM and for best performance should be run on a GPU. On this demo hardware 30 second audio will take ~4 minutes to transcribe.
         To improve speech recognition quality, more data is needed, add your voice on [Balsu talka](https://balsutalka.lv/)
         """
@@ -141,13 +136,11 @@ yt_transcribe = gr.Interface(
     ],
     # outputs=["html", "text"],
     outputs=[gr.HTML(), gr.Textbox(label="Transcription", lines=10)],
-    title="Latvian speech recognition: Transcribe YouTube",
     description=("""
-        Test Latvian speech recognition (STT) models. Three models are available:
-        * [tiny](https://huggingface.co/RaivisDejus/whisper-tiny-lv) - Fastest, requiring least RAM, but also poor accuracy
-        * [small](https://huggingface.co/RaivisDejus/whisper-small-lv) - Reasonably fast, reasonably accurate, requiring reasonable amounts of RAM
         To improve speech recognition quality, more data is needed, add your voice on [Balsu talka](https://balsutalka.lv/)
         """

         gr.Radio([("Transcribe", "transcribe"), ("Translate to English", "translate",)], label="Task", value="transcribe"),
     ],
     outputs=gr.Textbox(label="Transcription", lines=15),
+    title="Latvian speech recognition: Three models available",
+    description=("""
+        🤖 [tiny](https://huggingface.co/RaivisDejus/whisper-tiny-lv) - Fastest, requiring least RAM, but also poor accuracy.
+        On this demo hardware 30 second audio will take ~45 seconds to transcribe.
+        🤖 [small](https://huggingface.co/RaivisDejus/whisper-small-lv) - Reasonably fast, reasonably accurate, requiring reasonable amounts of RAM.
+        On this demo hardware 30 second audio will take ~1 minute to transcribe.
+        🤖 [large](https://huggingface.co/AiLab-IMCS-UL/whisper-large-v3-lv-late-cv17) - Most accurate, developed by scientists from [ailab.lv](https://ailab.lv/). Requires most RAM and for best performance should be run on a GPU.
+        On this demo hardware 30 second audio will take ~4 minutes to transcribe.
         To improve speech recognition quality, more data is needed, add your voice on [Balsu talka](https://balsutalka.lv/)
         """
     ],
     # outputs=["html", "text"],
     outputs=[gr.HTML(), gr.Textbox(label="Transcription", lines=10)],
+    title="Latvian speech recognition: Two models available",
     description=("""
+        🤖 [tiny](https://huggingface.co/RaivisDejus/whisper-tiny-lv) - Fastest, requiring least RAM, but also poor accuracy
+        🤖 [small](https://huggingface.co/RaivisDejus/whisper-small-lv) - Reasonably fast, reasonably accurate, requiring reasonable amounts of RAM
         To improve speech recognition quality, more data is needed, add your voice on [Balsu talka](https://balsutalka.lv/)
         """