Model fix

Files changed (5) hide show

README.md CHANGED Viewed

@@ -11,7 +11,7 @@ metrics:
 pipeline_tag: automatic-speech-recognition
 ---
-# Latvian Whisper tiny speech recognition model
 Trained on combination of:
 - Common Voice 17, custom selection of all validated clips, max 1000 clips per speaker
@@ -19,4 +19,4 @@ Trained on combination of:
 Both regular whisper model and CTranslate2 converted version for use with [faster-whisper](https://github.com/SYSTRAN/faster-whisper) as part of [Home Assistant Whisper integration](https://www.home-assistant.io/integrations/whisper/) are available.
-Speech recognition quality is poor, more data is needed, donate your voice on [Balsu talka](https://balsutalka.lv/)

 pipeline_tag: automatic-speech-recognition
 ---
+# Latvian Whisper small speech recognition model
 Trained on combination of:
 - Common Voice 17, custom selection of all validated clips, max 1000 clips per speaker
 Both regular whisper model and CTranslate2 converted version for use with [faster-whisper](https://github.com/SYSTRAN/faster-whisper) as part of [Home Assistant Whisper integration](https://www.home-assistant.io/integrations/whisper/) are available.
+To improve speech recognition quality, more data is needed, donate your voice on [Balsu talka](https://balsutalka.lv/)

config.json CHANGED Viewed

@@ -49,7 +49,7 @@
   "use_cache": true,
   "use_weighted_layer_sum": false,
   "vocab_size": 51865,
-    "alignment_heads": [
     [
       5,
       3

   "use_cache": true,
   "use_weighted_layer_sum": false,
   "vocab_size": 51865,
+  "alignment_heads": [
     [
       5,
       3

convert-to-safetensors.py → fix-model-metadata.py RENAMED Viewed

@@ -1,15 +1,9 @@
-# pip install git+https://github.com/openai/whisper.git
-# pip install safetensors
-import whisper
-import safetensors.torch
-model = whisper.load_model("small")
-safetensors.torch.save_model(model, "model.safetensors")
 tensors = dict()
 with safetensors.safe_open("./model.safetensors", framework="pt") as f:
     for key in f.keys():
         tensors[key] = f.get_tensor(key)
-safetensors.torch.save_file(tensors, "./model.safetensors", metadata={'format': 'pt'})

+import safetensors
+from safetensors.torch import save_file
 tensors = dict()
 with safetensors.safe_open("./model.safetensors", framework="pt") as f:
     for key in f.keys():
         tensors[key] = f.get_tensor(key)
+save_file(tensors, "./model.safetensors", metadata={'format': 'pt'})

model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:87b4355275c56e5bdc097320ac7c0131d88058057851a792eae0109da45d26dd
-size 482976402

 version https://git-lfs.github.com/spec/v1
+oid sha256:df9ddf80e8488036fe50bcbabd56affd4e6159223faa4c197aaf3553ccfcb376
+size 483547016

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:60c79be00a6148f08f8147384322114b3fefe51f22d2e39af30ea8eae83dffa7
-size 966989264

 version https://git-lfs.github.com/spec/v1
+oid sha256:623d0cc8bde553c1a83f7165fdc927c3f03dacd73ee565506d341c84a47b6d7a
+size 966995080