File size: 2,043 Bytes
c20b6de
 
1f211a9
c20b6de
 
 
1040275
c20b6de
 
 
 
22529f3
1f211a9
 
22529f3
14485b0
22529f3
0b43409
1f211a9
 
 
 
9da9a4e
1f211a9
 
 
72475af
 
 
 
 
1f211a9
 
fee695e
14485b0
 
 
fee695e
4a80c25
e2f1242
1f211a9
e2f1242
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
---
title: "Ukrainian Speech-to-Text"
emoji: 🐌
colorFrom: blue
colorTo: yellow
sdk: gradio
sdk_version: 3.41.2
app_file: app.py
pinned: false
---

# 🇺🇦🎤 Voice recognition for Ukrainian language
This is a repository with aim to apply various speech recognition models on Ukrainian language.  

You can see online demo here: https://huggingface.co/spaces/robinhad/ukrainian-stt.  
Github link: https://github.com/robinhad/voice-recognition-ua.  
Source code is in this repository together with auto-deploy pipeline scripts. 


# 🧮 Models
Model name  |  CER  |  WER  | License | Note
:-------------------------|:-------------------------|:-------------------------|:-------------------------|:-------------------------
[Wav2Vec2](https://github.com/robinhad/voice-recognition-ua/releases/tag/release%2Fwav2vec2-v0.1) | 6,01% | 27,99% | MIT | Common Voice 8 dataset, `test` set used as validation
[DeepSpeech with Wiki LM](https://github.com/robinhad/voice-recognition-ua/releases/tag/v0.4) | 12% | 30,65% | CC-BY-NC 4.0 | Common Voice 6 dataset
[DeepSpeech](https://github.com/robinhad/voice-recognition-ua/releases/tag/v0.4) | 16% | 57% | CC-BY-NC 4.0 | Common Voice 6 dataset


Checkout latest releases here: https://github.com/robinhad/voice-recognition-ua/releases/.

If you'd like to check out different models for Ukrainian language, please visit https://github.com/egorsmkv/speech-recognition-uk.

# 🤖 Training scripts
Guides for training are available in corresponding folders for each model.

# Support
If you like my work, please support here: https://send.monobank.ua/jar/48iHq4xAXm

# 🤝 Attribution
[@robinhad](https://github.com/robinhad) - model training.  
[@egorsmkv](https://github.com/egorsmkv) - organized [Ukrainian Speech recognition community](https://github.com/egorsmkv/speech-recognition-uk).  
[@tarasfrompir](https://github.com/tarasfrompir) - created synthetic 1200h Ukrainian Speech-to-Text dataset.  
[@AlexeyBoiler](https://github.com/AlexeyBoiler) - hosted Ukrainian Speech-to-Text dataset.