update readme for 2 step trainings

Browse files

Files changed (7) hide show

README.md +14 -3
README_inital_step.md +76 -0
config.json +1 -1
inference.ipynb +22 -48
pytorch_model.bin +1 -1
train_kh.ipynb +184 -550
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -20,8 +20,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on the openslr dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.4638
-- Wer: 0.4944
 ## Model description
@@ -48,7 +48,7 @@ The following hyperparameters were used during training:
 - total_train_batch_size: 32
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- lr_scheduler_warmup_steps: 1000
 - num_epochs: 50
 - mixed_precision_training: Native AMP
@@ -66,6 +66,17 @@ The following hyperparameters were used during training:
 | 1.4696        | 39.5  | 3200 | 0.5002          | 0.5130 |
 | 1.4175        | 44.44 | 3600 | 0.4752          | 0.5021 |
 | 1.3943        | 49.38 | 4000 | 0.4638          | 0.4944 |
 ### Framework versions

 This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on the openslr dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.3142
+- Wer: 0.3512
 ## Model description
 - total_train_batch_size: 32
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 100
 - num_epochs: 50
 - mixed_precision_training: Native AMP
 | 1.4696        | 39.5  | 3200 | 0.5002          | 0.5130 |
 | 1.4175        | 44.44 | 3600 | 0.4752          | 0.5021 |
 | 1.3943        | 49.38 | 4000 | 0.4638          | 0.4944 |
+| Pause and Resume |    |      |                 |        |
+| 1.3829        | 4.93  | 400  | 0.4290          | 0.4796 |
+| 1.3156        | 9.87  | 800  | 0.3856          | 0.4474 |
+| 1.2396        | 14.81 | 1200 | 0.3600          | 0.4307 |
+| 1.1444        | 19.75 | 1600 | 0.3423          | 0.4179 |
+| 1.0979        | 24.69 | 2000 | 0.3370          | 0.3884 |
+| 1.0714        | 29.62 | 2400 | 0.3237          | 0.3710 |
+| 1.0442        | 34.56 | 2800 | 0.3336          | 0.3683 |
+| 1.0492        | 39.5  | 3200 | 0.3166          | 0.3527 |
+| 1.0284        | 44.44 | 3600 | 0.3178          | 0.3566 |
+| 1.0302        | 49.38 | 4000 | 0.3142          | 0.3512 |
 ### Framework versions

README_inital_step.md ADDED Viewed

	@@ -0,0 +1,76 @@

+---
+language:
+- km
+license: apache-2.0
+tags:
+- automatic-speech-recognition
+- openslr
+- robust-speech-event
+- km
+- generated_from_trainer
+model-index:
+- name: ''
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+#
+This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on the openslr dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.4638
+- Wer: 0.4944
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 5e-05
+- train_batch_size: 8
+- eval_batch_size: 8
+- seed: 42
+- gradient_accumulation_steps: 4
+- total_train_batch_size: 32
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 1000
+- num_epochs: 50
+- mixed_precision_training: Native AMP
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | Wer    |
+|:-------------:|:-----:|:----:|:---------------:|:------:|
+| 5.2049        | 4.93  | 400  | 4.5570          | 1.0    |
+| 3.569         | 9.87  | 800  | 3.5415          | 1.0    |
+| 3.483         | 14.81 | 1200 | 3.3956          | 1.0    |
+| 2.1906        | 19.75 | 1600 | 1.1732          | 0.7897 |
+| 1.7968        | 24.69 | 2000 | 0.7634          | 0.6678 |
+| 1.615         | 29.62 | 2400 | 0.6182          | 0.5922 |
+| 1.52          | 34.56 | 2800 | 0.5473          | 0.5479 |
+| 1.4696        | 39.5  | 3200 | 0.5002          | 0.5130 |
+| 1.4175        | 44.44 | 3600 | 0.4752          | 0.5021 |
+| 1.3943        | 49.38 | 4000 | 0.4638          | 0.4944 |
+### Framework versions
+- Transformers 4.17.0.dev0
+- Pytorch 1.10.2+cu102
+- Datasets 1.18.2.dev0
+- Tokenizers 0.11.0

config.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
-  "_name_or_path": "facebook/wav2vec2-xls-r-300m",
   "activation_dropout": 0.0,
   "adapter_kernel_size": 3,
   "adapter_stride": 2,

 {
+  "_name_or_path": "checkpoint-4000",
   "activation_dropout": 0.0,
   "adapter_kernel_size": 3,
   "adapter_stride": 2,

inference.ipynb CHANGED Viewed

@@ -3,7 +3,7 @@
   {
    "cell_type": "code",
    "execution_count": 1,
-   "id": "438927ca",
    "metadata": {},
    "outputs": [],
    "source": [
@@ -16,46 +16,20 @@
   {
    "cell_type": "code",
    "execution_count": 5,
-   "id": "27a57965",
    "metadata": {},
    "outputs": [],
    "source": [
-    "model = AutoModelForCTC.from_pretrained(\".\").to('cuda')\n",
-    "processor = Wav2Vec2Processor.from_pretrained(\".\")"
    ]
   },
   {
    "cell_type": "code",
-   "execution_count": 3,
-   "id": "1d4324df",
-   "metadata": {
-    "collapsed": true,
-    "jupyter": {
-     "outputs_hidden": true
-    }
-   },
-   "outputs": [
-    {
-     "ename": "JSONDecodeError",
-     "evalue": "Expecting value: line 1 column 1 (char 0)",
-     "output_type": "error",
-     "traceback": [
-      "\u001b[0;31m---------------------------------------------------------------------------\u001b[0m",
-      "\u001b[0;31mJSONDecodeError\u001b[0m                           Traceback (most recent call last)",
-      "Input \u001b[0;32mIn [3]\u001b[0m, in \u001b[0;36m<module>\u001b[0;34m\u001b[0m\n\u001b[1;32m      1\u001b[0m model \u001b[38;5;241m=\u001b[39m AutoModelForCTC\u001b[38;5;241m.\u001b[39mfrom_pretrained(\u001b[38;5;124m\"\u001b[39m\u001b[38;5;124mvitouphy/xls-r-300m-km\u001b[39m\u001b[38;5;124m\"\u001b[39m)\u001b[38;5;241m.\u001b[39mto(\u001b[38;5;124m'\u001b[39m\u001b[38;5;124mcuda\u001b[39m\u001b[38;5;124m'\u001b[39m)\n\u001b[0;32m----> 2\u001b[0m processor \u001b[38;5;241m=\u001b[39m \u001b[43mWav2Vec2Processor\u001b[49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[43mfrom_pretrained\u001b[49m\u001b[43m(\u001b[49m\u001b[38;5;124;43m\"\u001b[39;49m\u001b[38;5;124;43mvitouphy/xls-r-300m-km\u001b[39;49m\u001b[38;5;124;43m\"\u001b[39;49m\u001b[43m)\u001b[49m\n",
-      "File \u001b[0;32m/opt/conda/lib/python3.8/site-packages/transformers/models/wav2vec2/processing_wav2vec2.py:117\u001b[0m, in \u001b[0;36mWav2Vec2Processor.from_pretrained\u001b[0;34m(cls, pretrained_model_name_or_path, **kwargs)\u001b[0m\n\u001b[1;32m    112\u001b[0m \u001b[38;5;66;03m# load generic `AutoTokenizer`\u001b[39;00m\n\u001b[1;32m    113\u001b[0m \u001b[38;5;66;03m# need fallback here for backward compatibility in case processor is\u001b[39;00m\n\u001b[1;32m    114\u001b[0m \u001b[38;5;66;03m# loaded from just a tokenizer file that does not have a `tokenizer_class` attribute\u001b[39;00m\n\u001b[1;32m    115\u001b[0m \u001b[38;5;66;03m# behavior should be deprecated in major future release\u001b[39;00m\n\u001b[1;32m    116\u001b[0m \u001b[38;5;28;01mtry\u001b[39;00m:\n\u001b[0;32m--> 117\u001b[0m     tokenizer \u001b[38;5;241m=\u001b[39m \u001b[43mAutoTokenizer\u001b[49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[43mfrom_pretrained\u001b[49m\u001b[43m(\u001b[49m\u001b[43mpretrained_model_name_or_path\u001b[49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[38;5;241;43m*\u001b[39;49m\u001b[38;5;241;43m*\u001b[39;49m\u001b[43mkwargs\u001b[49m\u001b[43m)\u001b[49m\n\u001b[1;32m    118\u001b[0m \u001b[38;5;28;01mexcept\u001b[39;00m \u001b[38;5;167;01mOSError\u001b[39;00m:\n\u001b[1;32m    119\u001b[0m     warnings\u001b[38;5;241m.\u001b[39mwarn(\n\u001b[1;32m    120\u001b[0m         \u001b[38;5;124mf\u001b[39m\u001b[38;5;124m\"\u001b[39m\u001b[38;5;124mLoading a tokenizer inside \u001b[39m\u001b[38;5;132;01m{\u001b[39;00m\u001b[38;5;28mcls\u001b[39m\u001b[38;5;241m.\u001b[39m\u001b[38;5;18m__name__\u001b[39m\u001b[38;5;132;01m}\u001b[39;00m\u001b[38;5;124m from a config that does not\u001b[39m\u001b[38;5;124m\"\u001b[39m\n\u001b[1;32m    121\u001b[0m         \u001b[38;5;124m\"\u001b[39m\u001b[38;5;124m include a `tokenizer_class` attribute is deprecated and will be \u001b[39m\u001b[38;5;124m\"\u001b[39m\n\u001b[0;32m   (...)\u001b[0m\n\u001b[1;32m    125\u001b[0m         \u001b[38;5;167;01mFutureWarning\u001b[39;00m,\n\u001b[1;32m    126\u001b[0m     )\n",
-      "File \u001b[0;32m/opt/conda/lib/python3.8/site-packages/transformers/models/auto/tokenization_auto.py:514\u001b[0m, in \u001b[0;36mAutoTokenizer.from_pretrained\u001b[0;34m(cls, pretrained_model_name_or_path, *inputs, **kwargs)\u001b[0m\n\u001b[1;32m    510\u001b[0m     \u001b[38;5;28;01mif\u001b[39;00m tokenizer_class \u001b[38;5;129;01mis\u001b[39;00m \u001b[38;5;28;01mNone\u001b[39;00m:\n\u001b[1;32m    511\u001b[0m         \u001b[38;5;28;01mraise\u001b[39;00m \u001b[38;5;167;01mValueError\u001b[39;00m(\n\u001b[1;32m    512\u001b[0m             \u001b[38;5;124mf\u001b[39m\u001b[38;5;124m\"\u001b[39m\u001b[38;5;124mTokenizer class \u001b[39m\u001b[38;5;132;01m{\u001b[39;00mtokenizer_class_candidate\u001b[38;5;132;01m}\u001b[39;00m\u001b[38;5;124m does not exist or is not currently imported.\u001b[39m\u001b[38;5;124m\"\u001b[39m\n\u001b[1;32m    513\u001b[0m         )\n\u001b[0;32m--> 514\u001b[0m     \u001b[38;5;28;01mreturn\u001b[39;00m \u001b[43mtokenizer_class\u001b[49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[43mfrom_pretrained\u001b[49m\u001b[43m(\u001b[49m\u001b[43mpretrained_model_name_or_path\u001b[49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[38;5;241;43m*\u001b[39;49m\u001b[43minputs\u001b[49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[38;5;241;43m*\u001b[39;49m\u001b[38;5;241;43m*\u001b[39;49m\u001b[43mkwargs\u001b[49m\u001b[43m)\u001b[49m\n\u001b[1;32m    516\u001b[0m \u001b[38;5;66;03m# Otherwise we have to be creative.\u001b[39;00m\n\u001b[1;32m    517\u001b[0m \u001b[38;5;66;03m# if model is an encoder decoder, the encoder tokenizer class is used by default\u001b[39;00m\n\u001b[1;32m    518\u001b[0m \u001b[38;5;28;01mif\u001b[39;00m \u001b[38;5;28misinstance\u001b[39m(config, EncoderDecoderConfig):\n",
-      "File \u001b[0;32m/opt/conda/lib/python3.8/site-packages/transformers/tokenization_utils_base.py:1773\u001b[0m, in \u001b[0;36mPreTrainedTokenizerBase.from_pretrained\u001b[0;34m(cls, pretrained_model_name_or_path, *init_inputs, **kwargs)\u001b[0m\n\u001b[1;32m   1770\u001b[0m     \u001b[38;5;28;01melse\u001b[39;00m:\n\u001b[1;32m   1771\u001b[0m         logger\u001b[38;5;241m.\u001b[39minfo(\u001b[38;5;124mf\u001b[39m\u001b[38;5;124m\"\u001b[39m\u001b[38;5;124mloading file \u001b[39m\u001b[38;5;132;01m{\u001b[39;00mfile_path\u001b[38;5;132;01m}\u001b[39;00m\u001b[38;5;124m from cache at \u001b[39m\u001b[38;5;132;01m{\u001b[39;00mresolved_vocab_files[file_id]\u001b[38;5;132;01m}\u001b[39;00m\u001b[38;5;124m\"\u001b[39m)\n\u001b[0;32m-> 1773\u001b[0m \u001b[38;5;28;01mreturn\u001b[39;00m \u001b[38;5;28;43mcls\u001b[39;49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[43m_from_pretrained\u001b[49m\u001b[43m(\u001b[49m\n\u001b[1;32m   1774\u001b[0m \u001b[43m    \u001b[49m\u001b[43mresolved_vocab_files\u001b[49m\u001b[43m,\u001b[49m\n\u001b[1;32m   1775\u001b[0m \u001b[43m    \u001b[49m\u001b[43mpretrained_model_name_or_path\u001b[49m\u001b[43m,\u001b[49m\n\u001b[1;32m   1776\u001b[0m \u001b[43m    \u001b[49m\u001b[43minit_configuration\u001b[49m\u001b[43m,\u001b[49m\n\u001b[1;32m   1777\u001b[0m \u001b[43m    \u001b[49m\u001b[38;5;241;43m*\u001b[39;49m\u001b[43minit_inputs\u001b[49m\u001b[43m,\u001b[49m\n\u001b[1;32m   1778\u001b[0m \u001b[43m    \u001b[49m\u001b[43muse_auth_token\u001b[49m\u001b[38;5;241;43m=\u001b[39;49m\u001b[43muse_auth_token\u001b[49m\u001b[43m,\u001b[49m\n\u001b[1;32m   1779\u001b[0m \u001b[43m    \u001b[49m\u001b[43mcache_dir\u001b[49m\u001b[38;5;241;43m=\u001b[39;49m\u001b[43mcache_dir\u001b[49m\u001b[43m,\u001b[49m\n\u001b[1;32m   1780\u001b[0m \u001b[43m    \u001b[49m\u001b[38;5;241;43m*\u001b[39;49m\u001b[38;5;241;43m*\u001b[39;49m\u001b[43mkwargs\u001b[49m\u001b[43m,\u001b[49m\n\u001b[1;32m   1781\u001b[0m \u001b[43m\u001b[49m\u001b[43m)\u001b[49m\n",
-      "File \u001b[0;32m/opt/conda/lib/python3.8/site-packages/transformers/tokenization_utils_base.py:1908\u001b[0m, in \u001b[0;36mPreTrainedTokenizerBase._from_pretrained\u001b[0;34m(cls, resolved_vocab_files, pretrained_model_name_or_path, init_configuration, use_auth_token, cache_dir, *init_inputs, **kwargs)\u001b[0m\n\u001b[1;32m   1906\u001b[0m \u001b[38;5;66;03m# Instantiate tokenizer.\u001b[39;00m\n\u001b[1;32m   1907\u001b[0m \u001b[38;5;28;01mtry\u001b[39;00m:\n\u001b[0;32m-> 1908\u001b[0m     tokenizer \u001b[38;5;241m=\u001b[39m \u001b[38;5;28;43mcls\u001b[39;49m\u001b[43m(\u001b[49m\u001b[38;5;241;43m*\u001b[39;49m\u001b[43minit_inputs\u001b[49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[38;5;241;43m*\u001b[39;49m\u001b[38;5;241;43m*\u001b[39;49m\u001b[43minit_kwargs\u001b[49m\u001b[43m)\u001b[49m\n\u001b[1;32m   1909\u001b[0m \u001b[38;5;28;01mexcept\u001b[39;00m \u001b[38;5;167;01mOSError\u001b[39;00m:\n\u001b[1;32m   1910\u001b[0m     \u001b[38;5;28;01mraise\u001b[39;00m \u001b[38;5;167;01mOSError\u001b[39;00m(\n\u001b[1;32m   1911\u001b[0m         \u001b[38;5;124m\"\u001b[39m\u001b[38;5;124mUnable to load vocabulary from file. \u001b[39m\u001b[38;5;124m\"\u001b[39m\n\u001b[1;32m   1912\u001b[0m         \u001b[38;5;124m\"\u001b[39m\u001b[38;5;124mPlease check that the provided vocabulary is accessible and not corrupted.\u001b[39m\u001b[38;5;124m\"\u001b[39m\n\u001b[1;32m   1913\u001b[0m     )\n",
-      "File \u001b[0;32m/opt/conda/lib/python3.8/site-packages/transformers/models/wav2vec2/tokenization_wav2vec2.py:142\u001b[0m, in \u001b[0;36mWav2Vec2CTCTokenizer.__init__\u001b[0;34m(self, vocab_file, bos_token, eos_token, unk_token, pad_token, word_delimiter_token, do_lower_case, **kwargs)\u001b[0m\n\u001b[1;32m    139\u001b[0m \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39mdo_lower_case \u001b[38;5;241m=\u001b[39m do_lower_case\n\u001b[1;32m    141\u001b[0m \u001b[38;5;28;01mwith\u001b[39;00m \u001b[38;5;28mopen\u001b[39m(vocab_file, encoding\u001b[38;5;241m=\u001b[39m\u001b[38;5;124m\"\u001b[39m\u001b[38;5;124mutf-8\u001b[39m\u001b[38;5;124m\"\u001b[39m) \u001b[38;5;28;01mas\u001b[39;00m vocab_handle:\n\u001b[0;32m--> 142\u001b[0m     \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39mencoder \u001b[38;5;241m=\u001b[39m \u001b[43mjson\u001b[49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[43mload\u001b[49m\u001b[43m(\u001b[49m\u001b[43mvocab_handle\u001b[49m\u001b[43m)\u001b[49m\n\u001b[1;32m    143\u001b[0m \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39mdecoder \u001b[38;5;241m=\u001b[39m {v: k \u001b[38;5;28;01mfor\u001b[39;00m k, v \u001b[38;5;129;01min\u001b[39;00m \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39mencoder\u001b[38;5;241m.\u001b[39mitems()}\n\u001b[1;32m    145\u001b[0m \u001b[38;5;66;03m# make sure that tokens made of several\u001b[39;00m\n\u001b[1;32m    146\u001b[0m \u001b[38;5;66;03m# characters are not split at tokenization\u001b[39;00m\n",
-      "File \u001b[0;32m/opt/conda/lib/python3.8/json/__init__.py:293\u001b[0m, in \u001b[0;36mload\u001b[0;34m(fp, cls, object_hook, parse_float, parse_int, parse_constant, object_pairs_hook, **kw)\u001b[0m\n\u001b[1;32m    274\u001b[0m \u001b[38;5;28;01mdef\u001b[39;00m \u001b[38;5;21mload\u001b[39m(fp, \u001b[38;5;241m*\u001b[39m, \u001b[38;5;28mcls\u001b[39m\u001b[38;5;241m=\u001b[39m\u001b[38;5;28;01mNone\u001b[39;00m, object_hook\u001b[38;5;241m=\u001b[39m\u001b[38;5;28;01mNone\u001b[39;00m, parse_float\u001b[38;5;241m=\u001b[39m\u001b[38;5;28;01mNone\u001b[39;00m,\n\u001b[1;32m    275\u001b[0m         parse_int\u001b[38;5;241m=\u001b[39m\u001b[38;5;28;01mNone\u001b[39;00m, parse_constant\u001b[38;5;241m=\u001b[39m\u001b[38;5;28;01mNone\u001b[39;00m, object_pairs_hook\u001b[38;5;241m=\u001b[39m\u001b[38;5;28;01mNone\u001b[39;00m, \u001b[38;5;241m*\u001b[39m\u001b[38;5;241m*\u001b[39mkw):\n\u001b[1;32m    276\u001b[0m     \u001b[38;5;124;03m\"\"\"Deserialize ``fp`` (a ``.read()``-supporting file-like object containing\u001b[39;00m\n\u001b[1;32m    277\u001b[0m \u001b[38;5;124;03m    a JSON document) to a Python object.\u001b[39;00m\n\u001b[1;32m    278\u001b[0m \n\u001b[0;32m   (...)\u001b[0m\n\u001b[1;32m    291\u001b[0m \u001b[38;5;124;03m    kwarg; otherwise ``JSONDecoder`` is used.\u001b[39;00m\n\u001b[1;32m    292\u001b[0m \u001b[38;5;124;03m    \"\"\"\u001b[39;00m\n\u001b[0;32m--> 293\u001b[0m     \u001b[38;5;28;01mreturn\u001b[39;00m \u001b[43mloads\u001b[49m\u001b[43m(\u001b[49m\u001b[43mfp\u001b[49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[43mread\u001b[49m\u001b[43m(\u001b[49m\u001b[43m)\u001b[49m\u001b[43m,\u001b[49m\n\u001b[1;32m    294\u001b[0m \u001b[43m        \u001b[49m\u001b[38;5;28;43mcls\u001b[39;49m\u001b[38;5;241;43m=\u001b[39;49m\u001b[38;5;28;43mcls\u001b[39;49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[43mobject_hook\u001b[49m\u001b[38;5;241;43m=\u001b[39;49m\u001b[43mobject_hook\u001b[49m\u001b[43m,\u001b[49m\n\u001b[1;32m    295\u001b[0m \u001b[43m        \u001b[49m\u001b[43mparse_float\u001b[49m\u001b[38;5;241;43m=\u001b[39;49m\u001b[43mparse_float\u001b[49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[43mparse_int\u001b[49m\u001b[38;5;241;43m=\u001b[39;49m\u001b[43mparse_int\u001b[49m\u001b[43m,\u001b[49m\n\u001b[1;32m    296\u001b[0m \u001b[43m        \u001b[49m\u001b[43mparse_constant\u001b[49m\u001b[38;5;241;43m=\u001b[39;49m\u001b[43mparse_constant\u001b[49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[43mobject_pairs_hook\u001b[49m\u001b[38;5;241;43m=\u001b[39;49m\u001b[43mobject_pairs_hook\u001b[49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[38;5;241;43m*\u001b[39;49m\u001b[38;5;241;43m*\u001b[39;49m\u001b[43mkw\u001b[49m\u001b[43m)\u001b[49m\n",
-      "File \u001b[0;32m/opt/conda/lib/python3.8/json/__init__.py:357\u001b[0m, in \u001b[0;36mloads\u001b[0;34m(s, cls, object_hook, parse_float, parse_int, parse_constant, object_pairs_hook, **kw)\u001b[0m\n\u001b[1;32m    352\u001b[0m     \u001b[38;5;28;01mdel\u001b[39;00m kw[\u001b[38;5;124m'\u001b[39m\u001b[38;5;124mencoding\u001b[39m\u001b[38;5;124m'\u001b[39m]\n\u001b[1;32m    354\u001b[0m \u001b[38;5;28;01mif\u001b[39;00m (\u001b[38;5;28mcls\u001b[39m \u001b[38;5;129;01mis\u001b[39;00m \u001b[38;5;28;01mNone\u001b[39;00m \u001b[38;5;129;01mand\u001b[39;00m object_hook \u001b[38;5;129;01mis\u001b[39;00m \u001b[38;5;28;01mNone\u001b[39;00m \u001b[38;5;129;01mand\u001b[39;00m\n\u001b[1;32m    355\u001b[0m         parse_int \u001b[38;5;129;01mis\u001b[39;00m \u001b[38;5;28;01mNone\u001b[39;00m \u001b[38;5;129;01mand\u001b[39;00m parse_float \u001b[38;5;129;01mis\u001b[39;00m \u001b[38;5;28;01mNone\u001b[39;00m \u001b[38;5;129;01mand\u001b[39;00m\n\u001b[1;32m    356\u001b[0m         parse_constant \u001b[38;5;129;01mis\u001b[39;00m \u001b[38;5;28;01mNone\u001b[39;00m \u001b[38;5;129;01mand\u001b[39;00m object_pairs_hook \u001b[38;5;129;01mis\u001b[39;00m \u001b[38;5;28;01mNone\u001b[39;00m \u001b[38;5;129;01mand\u001b[39;00m \u001b[38;5;129;01mnot\u001b[39;00m kw):\n\u001b[0;32m--> 357\u001b[0m     \u001b[38;5;28;01mreturn\u001b[39;00m \u001b[43m_default_decoder\u001b[49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[43mdecode\u001b[49m\u001b[43m(\u001b[49m\u001b[43ms\u001b[49m\u001b[43m)\u001b[49m\n\u001b[1;32m    358\u001b[0m \u001b[38;5;28;01mif\u001b[39;00m \u001b[38;5;28mcls\u001b[39m \u001b[38;5;129;01mis\u001b[39;00m \u001b[38;5;28;01mNone\u001b[39;00m:\n\u001b[1;32m    359\u001b[0m     \u001b[38;5;28mcls\u001b[39m \u001b[38;5;241m=\u001b[39m JSONDecoder\n",
-      "File \u001b[0;32m/opt/conda/lib/python3.8/json/decoder.py:337\u001b[0m, in \u001b[0;36mJSONDecoder.decode\u001b[0;34m(self, s, _w)\u001b[0m\n\u001b[1;32m    332\u001b[0m \u001b[38;5;28;01mdef\u001b[39;00m \u001b[38;5;21mdecode\u001b[39m(\u001b[38;5;28mself\u001b[39m, s, _w\u001b[38;5;241m=\u001b[39mWHITESPACE\u001b[38;5;241m.\u001b[39mmatch):\n\u001b[1;32m    333\u001b[0m     \u001b[38;5;124;03m\"\"\"Return the Python representation of ``s`` (a ``str`` instance\u001b[39;00m\n\u001b[1;32m    334\u001b[0m \u001b[38;5;124;03m    containing a JSON document).\u001b[39;00m\n\u001b[1;32m    335\u001b[0m \n\u001b[1;32m    336\u001b[0m \u001b[38;5;124;03m    \"\"\"\u001b[39;00m\n\u001b[0;32m--> 337\u001b[0m     obj, end \u001b[38;5;241m=\u001b[39m \u001b[38;5;28;43mself\u001b[39;49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[43mraw_decode\u001b[49m\u001b[43m(\u001b[49m\u001b[43ms\u001b[49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[43midx\u001b[49m\u001b[38;5;241;43m=\u001b[39;49m\u001b[43m_w\u001b[49m\u001b[43m(\u001b[49m\u001b[43ms\u001b[49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[38;5;241;43m0\u001b[39;49m\u001b[43m)\u001b[49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[43mend\u001b[49m\u001b[43m(\u001b[49m\u001b[43m)\u001b[49m\u001b[43m)\u001b[49m\n\u001b[1;32m    338\u001b[0m     end \u001b[38;5;241m=\u001b[39m _w(s, end)\u001b[38;5;241m.\u001b[39mend()\n\u001b[1;32m    339\u001b[0m     \u001b[38;5;28;01mif\u001b[39;00m end \u001b[38;5;241m!=\u001b[39m \u001b[38;5;28mlen\u001b[39m(s):\n",
-      "File \u001b[0;32m/opt/conda/lib/python3.8/json/decoder.py:355\u001b[0m, in \u001b[0;36mJSONDecoder.raw_decode\u001b[0;34m(self, s, idx)\u001b[0m\n\u001b[1;32m    353\u001b[0m     obj, end \u001b[38;5;241m=\u001b[39m \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39mscan_once(s, idx)\n\u001b[1;32m    354\u001b[0m \u001b[38;5;28;01mexcept\u001b[39;00m \u001b[38;5;167;01mStopIteration\u001b[39;00m \u001b[38;5;28;01mas\u001b[39;00m err:\n\u001b[0;32m--> 355\u001b[0m     \u001b[38;5;28;01mraise\u001b[39;00m JSONDecodeError(\u001b[38;5;124m\"\u001b[39m\u001b[38;5;124mExpecting value\u001b[39m\u001b[38;5;124m\"\u001b[39m, s, err\u001b[38;5;241m.\u001b[39mvalue) \u001b[38;5;28;01mfrom\u001b[39;00m \u001b[38;5;28mNone\u001b[39m\n\u001b[1;32m    356\u001b[0m \u001b[38;5;28;01mreturn\u001b[39;00m obj, end\n",
-      "\u001b[0;31mJSONDecodeError\u001b[0m: Expecting value: line 1 column 1 (char 0)"
-     ]
-    }
-   ],
    "source": [
     "model = AutoModelForCTC.from_pretrained(\"vitouphy/xls-r-300m-km\").to('cuda')\n",
     "processor = Wav2Vec2Processor.from_pretrained(\"vitouphy/xls-r-300m-km\")"
@@ -64,7 +38,7 @@
   {
    "cell_type": "code",
    "execution_count": 8,
-   "id": "3d61ff3b",
    "metadata": {},
    "outputs": [
     {
@@ -83,7 +57,7 @@
   {
    "cell_type": "code",
    "execution_count": 9,
-   "id": "a03f3af4",
    "metadata": {},
    "outputs": [],
    "source": [
@@ -95,7 +69,7 @@
   {
    "cell_type": "code",
    "execution_count": 10,
-   "id": "9c88048b",
    "metadata": {},
    "outputs": [],
    "source": [
@@ -105,7 +79,7 @@
   {
    "cell_type": "code",
    "execution_count": 11,
-   "id": "f3bfc930",
    "metadata": {},
    "outputs": [
     {
@@ -130,7 +104,7 @@
   {
    "cell_type": "code",
    "execution_count": 12,
-   "id": "122a898b",
    "metadata": {},
    "outputs": [],
    "source": [
@@ -149,7 +123,7 @@
   {
    "cell_type": "code",
    "execution_count": 13,
-   "id": "153e7f45",
    "metadata": {},
    "outputs": [
     {
@@ -173,8 +147,8 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 17,
-   "id": "8947d307",
    "metadata": {},
    "outputs": [],
    "source": [
@@ -183,8 +157,8 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 18,
-   "id": "3d6b46ca",
    "metadata": {},
    "outputs": [
     {
@@ -203,8 +177,8 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 19,
-   "id": "d1550ddc",
    "metadata": {},
    "outputs": [
     {
@@ -232,7 +206,7 @@
   {
    "cell_type": "code",
    "execution_count": null,
-   "id": "5bbf1c82",
    "metadata": {},
    "outputs": [],
    "source": []
@@ -240,7 +214,7 @@
   {
    "cell_type": "code",
    "execution_count": null,
-   "id": "71b6f502",
    "metadata": {},
    "outputs": [],
    "source": []

   {
    "cell_type": "code",
    "execution_count": 1,
+   "id": "310fea8f",
    "metadata": {},
    "outputs": [],
    "source": [
   {
    "cell_type": "code",
    "execution_count": 5,
+   "id": "555c8316",
    "metadata": {},
    "outputs": [],
    "source": [
+    "# model = AutoModelForCTC.from_pretrained(\".\").to('cuda')\n",
+    "# processor = Wav2Vec2Processor.from_pretrained(\".\")"
    ]
   },
   {
    "cell_type": "code",
+   "execution_count": 20,
+   "id": "24cc91e8",
+   "metadata": {},
+   "outputs": [],
    "source": [
     "model = AutoModelForCTC.from_pretrained(\"vitouphy/xls-r-300m-km\").to('cuda')\n",
     "processor = Wav2Vec2Processor.from_pretrained(\"vitouphy/xls-r-300m-km\")"
   {
    "cell_type": "code",
    "execution_count": 8,
+   "id": "69d79b00",
    "metadata": {},
    "outputs": [
     {
   {
    "cell_type": "code",
    "execution_count": 9,
+   "id": "9c9a59b3",
    "metadata": {},
    "outputs": [],
    "source": [
   {
    "cell_type": "code",
    "execution_count": 10,
+   "id": "868afb48",
    "metadata": {},
    "outputs": [],
    "source": [
   {
    "cell_type": "code",
    "execution_count": 11,
+   "id": "f93e7f2a",
    "metadata": {},
    "outputs": [
     {
   {
    "cell_type": "code",
    "execution_count": 12,
+   "id": "c97bf6c8",
    "metadata": {},
    "outputs": [],
    "source": [
   {
    "cell_type": "code",
    "execution_count": 13,
+   "id": "8e6b77e3",
    "metadata": {},
    "outputs": [
     {
   },
   {
    "cell_type": "code",
+   "execution_count": 21,
+   "id": "53b5be56",
    "metadata": {},
    "outputs": [],
    "source": [
   },
   {
    "cell_type": "code",
+   "execution_count": 22,
+   "id": "15dda9d3",
    "metadata": {},
    "outputs": [
     {
   },
   {
    "cell_type": "code",
+   "execution_count": 23,
+   "id": "bc40d9dc",
    "metadata": {},
    "outputs": [
     {
   {
    "cell_type": "code",
    "execution_count": null,
+   "id": "f755f572",
    "metadata": {},
    "outputs": [],
    "source": []
   {
    "cell_type": "code",
    "execution_count": null,
+   "id": "16aa56dc",
    "metadata": {},
    "outputs": [],
    "source": []

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d0ba956e9839aa07016519fdfa526ee53a8b6a22d5f2e4b17268e9859f128846
 size 1262231153

 version https://git-lfs.github.com/spec/v1
+oid sha256:1b254f3cb4af80138f33d06456c61d7e3730f18b51646fbede34871daeafcc7e
 size 1262231153

train_kh.ipynb CHANGED Viewed

@@ -3,7 +3,7 @@
   {
    "cell_type": "code",
    "execution_count": 1,
-   "id": "1bf32ef8",
    "metadata": {},
    "outputs": [],
    "source": [
@@ -16,7 +16,7 @@
   {
    "cell_type": "code",
    "execution_count": null,
-   "id": "d2deec6c",
    "metadata": {
     "collapsed": true,
     "jupyter": {
@@ -19167,7 +19167,7 @@
   },
   {
    "cell_type": "markdown",
-   "id": "6fe38e7a",
    "metadata": {},
    "source": [
     "### Load KH Data"
@@ -19176,7 +19176,7 @@
   {
    "cell_type": "code",
    "execution_count": 4,
-   "id": "b75f1fec",
    "metadata": {},
    "outputs": [],
    "source": [
@@ -19197,7 +19197,7 @@
   {
    "cell_type": "code",
    "execution_count": 5,
-   "id": "433fe749",
    "metadata": {},
    "outputs": [
     {
@@ -19307,7 +19307,7 @@
   {
    "cell_type": "code",
    "execution_count": 6,
-   "id": "c6d633ad",
    "metadata": {},
    "outputs": [],
    "source": [
@@ -19321,7 +19321,7 @@
   },
   {
    "cell_type": "markdown",
-   "id": "acb914d0",
    "metadata": {},
    "source": [
     "### Clean Up the Text"
@@ -19330,7 +19330,7 @@
   {
    "cell_type": "code",
    "execution_count": 6,
-   "id": "bc3a017b",
    "metadata": {},
    "outputs": [],
    "source": [
@@ -19346,7 +19346,7 @@
   {
    "cell_type": "code",
    "execution_count": 7,
-   "id": "4a7b6a10",
    "metadata": {
     "collapsed": true,
     "jupyter": {
@@ -19402,7 +19402,7 @@
   {
    "cell_type": "code",
    "execution_count": 7,
-   "id": "7f511e3f",
    "metadata": {},
    "outputs": [
     {
@@ -19423,7 +19423,7 @@
   },
   {
    "cell_type": "markdown",
-   "id": "205a6e23",
    "metadata": {},
    "source": [
     "### Build Character"
@@ -19432,7 +19432,7 @@
   {
    "cell_type": "code",
    "execution_count": 8,
-   "id": "48a97fac",
    "metadata": {},
    "outputs": [
     {
@@ -19480,7 +19480,7 @@
   {
    "cell_type": "code",
    "execution_count": 9,
-   "id": "9b4ac5f7",
    "metadata": {},
    "outputs": [],
    "source": [
@@ -19491,7 +19491,7 @@
   {
    "cell_type": "code",
    "execution_count": 10,
-   "id": "a9a07875",
    "metadata": {},
    "outputs": [
     {
@@ -19509,7 +19509,7 @@
   {
    "cell_type": "code",
    "execution_count": 11,
-   "id": "8a3d39d8",
    "metadata": {},
    "outputs": [
     {
@@ -19536,7 +19536,7 @@
   {
    "cell_type": "code",
    "execution_count": 12,
-   "id": "934a4070",
    "metadata": {},
    "outputs": [
     {
@@ -19554,7 +19554,7 @@
   {
    "cell_type": "code",
    "execution_count": 13,
-   "id": "7f42a2b4",
    "metadata": {},
    "outputs": [],
    "source": [
@@ -19565,7 +19565,7 @@
   },
   {
    "cell_type": "markdown",
-   "id": "9a504bc4",
    "metadata": {},
    "source": [
     "# Tokenizer"
@@ -19574,7 +19574,7 @@
   {
    "cell_type": "code",
    "execution_count": 14,
-   "id": "0cec90b4",
    "metadata": {},
    "outputs": [],
    "source": [
@@ -19585,8 +19585,8 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 62,
-   "id": "dc9e79da",
    "metadata": {},
    "outputs": [
     {
@@ -19598,25 +19598,9 @@
       "loading file ./tokenizer_config.json\n",
       "loading file ./added_tokens.json\n",
       "loading file ./special_tokens_map.json\n",
-      "loading file None\n"
-     ]
-    },
-    {
-     "ename": "JSONDecodeError",
-     "evalue": "Expecting value: line 1 column 1 (char 0)",
-     "output_type": "error",
-     "traceback": [
-      "\u001b[0;31m---------------------------------------------------------------------------\u001b[0m",
-      "\u001b[0;31mJSONDecodeError\u001b[0m                           Traceback (most recent call last)",
-      "Input \u001b[0;32mIn [62]\u001b[0m, in \u001b[0;36m<module>\u001b[0;34m\u001b[0m\n\u001b[0;32m----> 1\u001b[0m tokenizer \u001b[38;5;241m=\u001b[39m \u001b[43mWav2Vec2CTCTokenizer\u001b[49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[43mfrom_pretrained\u001b[49m\u001b[43m(\u001b[49m\u001b[38;5;124;43m\"\u001b[39;49m\u001b[38;5;124;43m./\u001b[39;49m\u001b[38;5;124;43m\"\u001b[39;49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[43munk_token\u001b[49m\u001b[38;5;241;43m=\u001b[39;49m\u001b[38;5;124;43m\"\u001b[39;49m\u001b[38;5;124;43m[UNK]\u001b[39;49m\u001b[38;5;124;43m\"\u001b[39;49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[43mpad_token\u001b[49m\u001b[38;5;241;43m=\u001b[39;49m\u001b[38;5;124;43m\"\u001b[39;49m\u001b[38;5;124;43m[PAD]\u001b[39;49m\u001b[38;5;124;43m\"\u001b[39;49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[43mword_delimiter_token\u001b[49m\u001b[38;5;241;43m=\u001b[39;49m\u001b[38;5;124;43m\"\u001b[39;49m\u001b[38;5;124;43m|\u001b[39;49m\u001b[38;5;124;43m\"\u001b[39;49m\u001b[43m)\u001b[49m  \u001b[38;5;66;03m# './' load vocab.json in the current directory\u001b[39;00m\n\u001b[1;32m      2\u001b[0m feature_extractor \u001b[38;5;241m=\u001b[39m Wav2Vec2FeatureExtractor(feature_size\u001b[38;5;241m=\u001b[39m\u001b[38;5;241m1\u001b[39m, sampling_rate\u001b[38;5;241m=\u001b[39m\u001b[38;5;241m16000\u001b[39m, padding_value\u001b[38;5;241m=\u001b[39m\u001b[38;5;241m0.0\u001b[39m, do_normalize\u001b[38;5;241m=\u001b[39m\u001b[38;5;28;01mTrue\u001b[39;00m, return_attention_mask\u001b[38;5;241m=\u001b[39m\u001b[38;5;28;01mTrue\u001b[39;00m)  \n\u001b[1;32m      3\u001b[0m processor \u001b[38;5;241m=\u001b[39m Wav2Vec2Processor(feature_extractor\u001b[38;5;241m=\u001b[39mfeature_extractor, tokenizer\u001b[38;5;241m=\u001b[39mtokenizer)\n",
-      "File \u001b[0;32m/opt/conda/lib/python3.8/site-packages/transformers/tokenization_utils_base.py:1773\u001b[0m, in \u001b[0;36mPreTrainedTokenizerBase.from_pretrained\u001b[0;34m(cls, pretrained_model_name_or_path, *init_inputs, **kwargs)\u001b[0m\n\u001b[1;32m   1770\u001b[0m     \u001b[38;5;28;01melse\u001b[39;00m:\n\u001b[1;32m   1771\u001b[0m         logger\u001b[38;5;241m.\u001b[39minfo(\u001b[38;5;124mf\u001b[39m\u001b[38;5;124m\"\u001b[39m\u001b[38;5;124mloading file \u001b[39m\u001b[38;5;132;01m{\u001b[39;00mfile_path\u001b[38;5;132;01m}\u001b[39;00m\u001b[38;5;124m from cache at \u001b[39m\u001b[38;5;132;01m{\u001b[39;00mresolved_vocab_files[file_id]\u001b[38;5;132;01m}\u001b[39;00m\u001b[38;5;124m\"\u001b[39m)\n\u001b[0;32m-> 1773\u001b[0m \u001b[38;5;28;01mreturn\u001b[39;00m \u001b[38;5;28;43mcls\u001b[39;49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[43m_from_pretrained\u001b[49m\u001b[43m(\u001b[49m\n\u001b[1;32m   1774\u001b[0m \u001b[43m    \u001b[49m\u001b[43mresolved_vocab_files\u001b[49m\u001b[43m,\u001b[49m\n\u001b[1;32m   1775\u001b[0m \u001b[43m    \u001b[49m\u001b[43mpretrained_model_name_or_path\u001b[49m\u001b[43m,\u001b[49m\n\u001b[1;32m   1776\u001b[0m \u001b[43m    \u001b[49m\u001b[43minit_configuration\u001b[49m\u001b[43m,\u001b[49m\n\u001b[1;32m   1777\u001b[0m \u001b[43m    \u001b[49m\u001b[38;5;241;43m*\u001b[39;49m\u001b[43minit_inputs\u001b[49m\u001b[43m,\u001b[49m\n\u001b[1;32m   1778\u001b[0m \u001b[43m    \u001b[49m\u001b[43muse_auth_token\u001b[49m\u001b[38;5;241;43m=\u001b[39;49m\u001b[43muse_auth_token\u001b[49m\u001b[43m,\u001b[49m\n\u001b[1;32m   1779\u001b[0m \u001b[43m    \u001b[49m\u001b[43mcache_dir\u001b[49m\u001b[38;5;241;43m=\u001b[39;49m\u001b[43mcache_dir\u001b[49m\u001b[43m,\u001b[49m\n\u001b[1;32m   1780\u001b[0m \u001b[43m    \u001b[49m\u001b[38;5;241;43m*\u001b[39;49m\u001b[38;5;241;43m*\u001b[39;49m\u001b[43mkwargs\u001b[49m\u001b[43m,\u001b[49m\n\u001b[1;32m   1781\u001b[0m \u001b[43m\u001b[49m\u001b[43m)\u001b[49m\n",
-      "File \u001b[0;32m/opt/conda/lib/python3.8/site-packages/transformers/tokenization_utils_base.py:1908\u001b[0m, in \u001b[0;36mPreTrainedTokenizerBase._from_pretrained\u001b[0;34m(cls, resolved_vocab_files, pretrained_model_name_or_path, init_configuration, use_auth_token, cache_dir, *init_inputs, **kwargs)\u001b[0m\n\u001b[1;32m   1906\u001b[0m \u001b[38;5;66;03m# Instantiate tokenizer.\u001b[39;00m\n\u001b[1;32m   1907\u001b[0m \u001b[38;5;28;01mtry\u001b[39;00m:\n\u001b[0;32m-> 1908\u001b[0m     tokenizer \u001b[38;5;241m=\u001b[39m \u001b[38;5;28;43mcls\u001b[39;49m\u001b[43m(\u001b[49m\u001b[38;5;241;43m*\u001b[39;49m\u001b[43minit_inputs\u001b[49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[38;5;241;43m*\u001b[39;49m\u001b[38;5;241;43m*\u001b[39;49m\u001b[43minit_kwargs\u001b[49m\u001b[43m)\u001b[49m\n\u001b[1;32m   1909\u001b[0m \u001b[38;5;28;01mexcept\u001b[39;00m \u001b[38;5;167;01mOSError\u001b[39;00m:\n\u001b[1;32m   1910\u001b[0m     \u001b[38;5;28;01mraise\u001b[39;00m \u001b[38;5;167;01mOSError\u001b[39;00m(\n\u001b[1;32m   1911\u001b[0m         \u001b[38;5;124m\"\u001b[39m\u001b[38;5;124mUnable to load vocabulary from file. \u001b[39m\u001b[38;5;124m\"\u001b[39m\n\u001b[1;32m   1912\u001b[0m         \u001b[38;5;124m\"\u001b[39m\u001b[38;5;124mPlease check that the provided vocabulary is accessible and not corrupted.\u001b[39m\u001b[38;5;124m\"\u001b[39m\n\u001b[1;32m   1913\u001b[0m     )\n",
-      "File \u001b[0;32m/opt/conda/lib/python3.8/site-packages/transformers/models/wav2vec2/tokenization_wav2vec2.py:142\u001b[0m, in \u001b[0;36mWav2Vec2CTCTokenizer.__init__\u001b[0;34m(self, vocab_file, bos_token, eos_token, unk_token, pad_token, word_delimiter_token, do_lower_case, **kwargs)\u001b[0m\n\u001b[1;32m    139\u001b[0m \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39mdo_lower_case \u001b[38;5;241m=\u001b[39m do_lower_case\n\u001b[1;32m    141\u001b[0m \u001b[38;5;28;01mwith\u001b[39;00m \u001b[38;5;28mopen\u001b[39m(vocab_file, encoding\u001b[38;5;241m=\u001b[39m\u001b[38;5;124m\"\u001b[39m\u001b[38;5;124mutf-8\u001b[39m\u001b[38;5;124m\"\u001b[39m) \u001b[38;5;28;01mas\u001b[39;00m vocab_handle:\n\u001b[0;32m--> 142\u001b[0m     \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39mencoder \u001b[38;5;241m=\u001b[39m \u001b[43mjson\u001b[49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[43mload\u001b[49m\u001b[43m(\u001b[49m\u001b[43mvocab_handle\u001b[49m\u001b[43m)\u001b[49m\n\u001b[1;32m    143\u001b[0m \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39mdecoder \u001b[38;5;241m=\u001b[39m {v: k \u001b[38;5;28;01mfor\u001b[39;00m k, v \u001b[38;5;129;01min\u001b[39;00m \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39mencoder\u001b[38;5;241m.\u001b[39mitems()}\n\u001b[1;32m    145\u001b[0m \u001b[38;5;66;03m# make sure that tokens made of several\u001b[39;00m\n\u001b[1;32m    146\u001b[0m \u001b[38;5;66;03m# characters are not split at tokenization\u001b[39;00m\n",
-      "File \u001b[0;32m/opt/conda/lib/python3.8/json/__init__.py:293\u001b[0m, in \u001b[0;36mload\u001b[0;34m(fp, cls, object_hook, parse_float, parse_int, parse_constant, object_pairs_hook, **kw)\u001b[0m\n\u001b[1;32m    274\u001b[0m \u001b[38;5;28;01mdef\u001b[39;00m \u001b[38;5;21mload\u001b[39m(fp, \u001b[38;5;241m*\u001b[39m, \u001b[38;5;28mcls\u001b[39m\u001b[38;5;241m=\u001b[39m\u001b[38;5;28;01mNone\u001b[39;00m, object_hook\u001b[38;5;241m=\u001b[39m\u001b[38;5;28;01mNone\u001b[39;00m, parse_float\u001b[38;5;241m=\u001b[39m\u001b[38;5;28;01mNone\u001b[39;00m,\n\u001b[1;32m    275\u001b[0m         parse_int\u001b[38;5;241m=\u001b[39m\u001b[38;5;28;01mNone\u001b[39;00m, parse_constant\u001b[38;5;241m=\u001b[39m\u001b[38;5;28;01mNone\u001b[39;00m, object_pairs_hook\u001b[38;5;241m=\u001b[39m\u001b[38;5;28;01mNone\u001b[39;00m, \u001b[38;5;241m*\u001b[39m\u001b[38;5;241m*\u001b[39mkw):\n\u001b[1;32m    276\u001b[0m     \u001b[38;5;124;03m\"\"\"Deserialize ``fp`` (a ``.read()``-supporting file-like object containing\u001b[39;00m\n\u001b[1;32m    277\u001b[0m \u001b[38;5;124;03m    a JSON document) to a Python object.\u001b[39;00m\n\u001b[1;32m    278\u001b[0m \n\u001b[0;32m   (...)\u001b[0m\n\u001b[1;32m    291\u001b[0m \u001b[38;5;124;03m    kwarg; otherwise ``JSONDecoder`` is used.\u001b[39;00m\n\u001b[1;32m    292\u001b[0m \u001b[38;5;124;03m    \"\"\"\u001b[39;00m\n\u001b[0;32m--> 293\u001b[0m     \u001b[38;5;28;01mreturn\u001b[39;00m \u001b[43mloads\u001b[49m\u001b[43m(\u001b[49m\u001b[43mfp\u001b[49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[43mread\u001b[49m\u001b[43m(\u001b[49m\u001b[43m)\u001b[49m\u001b[43m,\u001b[49m\n\u001b[1;32m    294\u001b[0m \u001b[43m        \u001b[49m\u001b[38;5;28;43mcls\u001b[39;49m\u001b[38;5;241;43m=\u001b[39;49m\u001b[38;5;28;43mcls\u001b[39;49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[43mobject_hook\u001b[49m\u001b[38;5;241;43m=\u001b[39;49m\u001b[43mobject_hook\u001b[49m\u001b[43m,\u001b[49m\n\u001b[1;32m    295\u001b[0m \u001b[43m        \u001b[49m\u001b[43mparse_float\u001b[49m\u001b[38;5;241;43m=\u001b[39;49m\u001b[43mparse_float\u001b[49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[43mparse_int\u001b[49m\u001b[38;5;241;43m=\u001b[39;49m\u001b[43mparse_int\u001b[49m\u001b[43m,\u001b[49m\n\u001b[1;32m    296\u001b[0m \u001b[43m        \u001b[49m\u001b[43mparse_constant\u001b[49m\u001b[38;5;241;43m=\u001b[39;49m\u001b[43mparse_constant\u001b[49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[43mobject_pairs_hook\u001b[49m\u001b[38;5;241;43m=\u001b[39;49m\u001b[43mobject_pairs_hook\u001b[49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[38;5;241;43m*\u001b[39;49m\u001b[38;5;241;43m*\u001b[39;49m\u001b[43mkw\u001b[49m\u001b[43m)\u001b[49m\n",
-      "File \u001b[0;32m/opt/conda/lib/python3.8/json/__init__.py:357\u001b[0m, in \u001b[0;36mloads\u001b[0;34m(s, cls, object_hook, parse_float, parse_int, parse_constant, object_pairs_hook, **kw)\u001b[0m\n\u001b[1;32m    352\u001b[0m     \u001b[38;5;28;01mdel\u001b[39;00m kw[\u001b[38;5;124m'\u001b[39m\u001b[38;5;124mencoding\u001b[39m\u001b[38;5;124m'\u001b[39m]\n\u001b[1;32m    354\u001b[0m \u001b[38;5;28;01mif\u001b[39;00m (\u001b[38;5;28mcls\u001b[39m \u001b[38;5;129;01mis\u001b[39;00m \u001b[38;5;28;01mNone\u001b[39;00m \u001b[38;5;129;01mand\u001b[39;00m object_hook \u001b[38;5;129;01mis\u001b[39;00m \u001b[38;5;28;01mNone\u001b[39;00m \u001b[38;5;129;01mand\u001b[39;00m\n\u001b[1;32m    355\u001b[0m         parse_int \u001b[38;5;129;01mis\u001b[39;00m \u001b[38;5;28;01mNone\u001b[39;00m \u001b[38;5;129;01mand\u001b[39;00m parse_float \u001b[38;5;129;01mis\u001b[39;00m \u001b[38;5;28;01mNone\u001b[39;00m \u001b[38;5;129;01mand\u001b[39;00m\n\u001b[1;32m    356\u001b[0m         parse_constant \u001b[38;5;129;01mis\u001b[39;00m \u001b[38;5;28;01mNone\u001b[39;00m \u001b[38;5;129;01mand\u001b[39;00m object_pairs_hook \u001b[38;5;129;01mis\u001b[39;00m \u001b[38;5;28;01mNone\u001b[39;00m \u001b[38;5;129;01mand\u001b[39;00m \u001b[38;5;129;01mnot\u001b[39;00m kw):\n\u001b[0;32m--> 357\u001b[0m     \u001b[38;5;28;01mreturn\u001b[39;00m \u001b[43m_default_decoder\u001b[49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[43mdecode\u001b[49m\u001b[43m(\u001b[49m\u001b[43ms\u001b[49m\u001b[43m)\u001b[49m\n\u001b[1;32m    358\u001b[0m \u001b[38;5;28;01mif\u001b[39;00m \u001b[38;5;28mcls\u001b[39m \u001b[38;5;129;01mis\u001b[39;00m \u001b[38;5;28;01mNone\u001b[39;00m:\n\u001b[1;32m    359\u001b[0m     \u001b[38;5;28mcls\u001b[39m \u001b[38;5;241m=\u001b[39m JSONDecoder\n",
-      "File \u001b[0;32m/opt/conda/lib/python3.8/json/decoder.py:337\u001b[0m, in \u001b[0;36mJSONDecoder.decode\u001b[0;34m(self, s, _w)\u001b[0m\n\u001b[1;32m    332\u001b[0m \u001b[38;5;28;01mdef\u001b[39;00m \u001b[38;5;21mdecode\u001b[39m(\u001b[38;5;28mself\u001b[39m, s, _w\u001b[38;5;241m=\u001b[39mWHITESPACE\u001b[38;5;241m.\u001b[39mmatch):\n\u001b[1;32m    333\u001b[0m     \u001b[38;5;124;03m\"\"\"Return the Python representation of ``s`` (a ``str`` instance\u001b[39;00m\n\u001b[1;32m    334\u001b[0m \u001b[38;5;124;03m    containing a JSON document).\u001b[39;00m\n\u001b[1;32m    335\u001b[0m \n\u001b[1;32m    336\u001b[0m \u001b[38;5;124;03m    \"\"\"\u001b[39;00m\n\u001b[0;32m--> 337\u001b[0m     obj, end \u001b[38;5;241m=\u001b[39m \u001b[38;5;28;43mself\u001b[39;49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[43mraw_decode\u001b[49m\u001b[43m(\u001b[49m\u001b[43ms\u001b[49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[43midx\u001b[49m\u001b[38;5;241;43m=\u001b[39;49m\u001b[43m_w\u001b[49m\u001b[43m(\u001b[49m\u001b[43ms\u001b[49m\u001b[43m,\u001b[49m\u001b[43m \u001b[49m\u001b[38;5;241;43m0\u001b[39;49m\u001b[43m)\u001b[49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[43mend\u001b[49m\u001b[43m(\u001b[49m\u001b[43m)\u001b[49m\u001b[43m)\u001b[49m\n\u001b[1;32m    338\u001b[0m     end \u001b[38;5;241m=\u001b[39m _w(s, end)\u001b[38;5;241m.\u001b[39mend()\n\u001b[1;32m    339\u001b[0m     \u001b[38;5;28;01mif\u001b[39;00m end \u001b[38;5;241m!=\u001b[39m \u001b[38;5;28mlen\u001b[39m(s):\n",
-      "File \u001b[0;32m/opt/conda/lib/python3.8/json/decoder.py:355\u001b[0m, in \u001b[0;36mJSONDecoder.raw_decode\u001b[0;34m(self, s, idx)\u001b[0m\n\u001b[1;32m    353\u001b[0m     obj, end \u001b[38;5;241m=\u001b[39m \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39mscan_once(s, idx)\n\u001b[1;32m    354\u001b[0m \u001b[38;5;28;01mexcept\u001b[39;00m \u001b[38;5;167;01mStopIteration\u001b[39;00m \u001b[38;5;28;01mas\u001b[39;00m err:\n\u001b[0;32m--> 355\u001b[0m     \u001b[38;5;28;01mraise\u001b[39;00m JSONDecodeError(\u001b[38;5;124m\"\u001b[39m\u001b[38;5;124mExpecting value\u001b[39m\u001b[38;5;124m\"\u001b[39m, s, err\u001b[38;5;241m.\u001b[39mvalue) \u001b[38;5;28;01mfrom\u001b[39;00m \u001b[38;5;28mNone\u001b[39m\n\u001b[1;32m    356\u001b[0m \u001b[38;5;28;01mreturn\u001b[39;00m obj, end\n",
-      "\u001b[0;31mJSONDecodeError\u001b[0m: Expecting value: line 1 column 1 (char 0)"
      ]
     }
    ],
@@ -19629,7 +19613,7 @@
   {
    "cell_type": "code",
    "execution_count": 26,
-   "id": "61738038",
    "metadata": {},
    "outputs": [],
    "source": [
@@ -19646,7 +19630,7 @@
   {
    "cell_type": "code",
    "execution_count": 27,
-   "id": "4b72b9b8",
    "metadata": {},
    "outputs": [
     {
@@ -19686,7 +19670,7 @@
   {
    "cell_type": "code",
    "execution_count": 17,
-   "id": "ccb4a36c",
    "metadata": {},
    "outputs": [],
    "source": [
@@ -19697,7 +19681,7 @@
   {
    "cell_type": "code",
    "execution_count": 18,
-   "id": "cf9d1391",
    "metadata": {},
    "outputs": [
     {
@@ -19722,7 +19706,7 @@
   {
    "cell_type": "code",
    "execution_count": 19,
-   "id": "57ea4c6f",
    "metadata": {},
    "outputs": [
     {
@@ -19769,7 +19753,7 @@
   {
    "cell_type": "code",
    "execution_count": 20,
-   "id": "7cb9fd2a",
    "metadata": {},
    "outputs": [],
    "source": [
@@ -19791,7 +19775,7 @@
   {
    "cell_type": "code",
    "execution_count": 22,
-   "id": "42f2952f",
    "metadata": {},
    "outputs": [],
    "source": [
@@ -19802,7 +19786,7 @@
   {
    "cell_type": "code",
    "execution_count": 41,
-   "id": "fe093630",
    "metadata": {},
    "outputs": [],
    "source": [
@@ -19814,7 +19798,7 @@
   {
    "cell_type": "code",
    "execution_count": 25,
-   "id": "a6efe782",
    "metadata": {},
    "outputs": [],
    "source": [
@@ -19874,7 +19858,7 @@
   {
    "cell_type": "code",
    "execution_count": 26,
-   "id": "e82a3663",
    "metadata": {},
    "outputs": [],
    "source": [
@@ -19884,7 +19868,7 @@
   {
    "cell_type": "code",
    "execution_count": 27,
-   "id": "1df03ab8",
    "metadata": {},
    "outputs": [],
    "source": [
@@ -19894,8 +19878,8 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 44,
-   "id": "8304f047",
    "metadata": {},
    "outputs": [],
    "source": [
@@ -19908,9 +19892,9 @@
     "    pred_str = tokenizer.batch_decode(pred_ids)\n",
     "    label_str = tokenizer.batch_decode(pred.label_ids, group_tokens=False)\n",
     "\n",
-    "    print(\"pred : \", pred_ids[0])\n",
-    "    print(\"label: \", pred.label_ids[0])\n",
-    "    print(\"-----------------\")\n",
     "    \n",
     "    wer = wer_metric.compute(predictions=pred_str, references=label_str)\n",
     "\n",
@@ -19919,8 +19903,8 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 45,
-   "id": "f92c9b4d",
    "metadata": {
     "collapsed": true,
     "jupyter": {
@@ -19932,15 +19916,16 @@
      "name": "stderr",
      "output_type": "stream",
      "text": [
-      "loading configuration file https://huggingface.co/facebook/wav2vec2-xls-r-300m/resolve/main/config.json from cache at /workspace/.cache/huggingface/transformers/dabc27df63e37bd2a7a221c7774e35f36a280fbdf917cf54cadfc7df8c786f6f.a3e4c3c967d9985881e0ae550a5f6f668f897db5ab2e0802f9b97973b15970e6\n",
       "Model config Wav2Vec2Config {\n",
       "  \"activation_dropout\": 0.0,\n",
       "  \"adapter_kernel_size\": 3,\n",
       "  \"adapter_stride\": 2,\n",
       "  \"add_adapter\": false,\n",
       "  \"apply_spec_augment\": true,\n",
       "  \"architectures\": [\n",
-      "    \"Wav2Vec2ForPreTraining\"\n",
       "  ],\n",
       "  \"attention_dropout\": 0.1,\n",
       "  \"bos_token_id\": 1,\n",
@@ -20041,12 +20026,11 @@
       "  \"xvector_output_dim\": 512\n",
       "}\n",
       "\n",
-      "loading weights file https://huggingface.co/facebook/wav2vec2-xls-r-300m/resolve/main/pytorch_model.bin from cache at /workspace/.cache/huggingface/transformers/1e6a6507f3b689035cd4b247e2a37c154e27f39143f31357a49b4e38baeccc36.1edb32803799e27ed554eb7dd935f6745b1a0b17b0ea256442fe24db6eb546cd\n",
-      "Some weights of the model checkpoint at facebook/wav2vec2-xls-r-300m were not used when initializing Wav2Vec2ForCTC: ['project_q.bias', 'project_hid.weight', 'project_hid.bias', 'quantizer.weight_proj.bias', 'quantizer.weight_proj.weight', 'project_q.weight', 'quantizer.codevectors']\n",
-      "- This IS expected if you are initializing Wav2Vec2ForCTC from the checkpoint of a model trained on another task or with another architecture (e.g. initializing a BertForSequenceClassification model from a BertForPreTraining model).\n",
-      "- This IS NOT expected if you are initializing Wav2Vec2ForCTC from the checkpoint of a model that you expect to be exactly identical (initializing a BertForSequenceClassification model from a BertForSequenceClassification model).\n",
-      "Some weights of Wav2Vec2ForCTC were not initialized from the model checkpoint at facebook/wav2vec2-xls-r-300m and are newly initialized: ['lm_head.weight', 'lm_head.bias']\n",
-      "You should probably TRAIN this model on a down-stream task to be able to use it for predictions and inference.\n"
      ]
     }
    ],
@@ -20054,7 +20038,8 @@
     "from transformers import Wav2Vec2ForCTC\n",
     "\n",
     "model = Wav2Vec2ForCTC.from_pretrained(\n",
-    "    \"facebook/wav2vec2-xls-r-300m\", \n",
     "    attention_dropout=0.1,\n",
     "    layerdrop=0.0,\n",
     "    feat_proj_dropout=0.0,\n",
@@ -20070,8 +20055,8 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 46,
-   "id": "7f2dd147",
    "metadata": {},
    "outputs": [],
    "source": [
@@ -20080,8 +20065,8 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 47,
-   "id": "3d27466c",
    "metadata": {},
    "outputs": [
     {
@@ -20109,7 +20094,7 @@
     "  eval_steps=400,\n",
     "  logging_steps=100,\n",
     "  learning_rate=5e-5,\n",
-    "  warmup_steps=1000,\n",
     "  save_total_limit=3,\n",
     "  load_best_model_at_end=True\n",
     ")"
@@ -20117,8 +20102,8 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 48,
-   "id": "014ac4c9",
    "metadata": {},
    "outputs": [
     {
@@ -20145,14 +20130,9 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 49,
-   "id": "e6cb809a",
-   "metadata": {
-    "collapsed": true,
-    "jupyter": {
-     "outputs_hidden": true
-    }
-   },
    "outputs": [
     {
      "name": "stderr",
@@ -20177,7 +20157,7 @@
        "    <div>\n",
        "      \n",
        "      <progress value='4050' max='4050' style='width:300px; height:20px; vertical-align: middle;'></progress>\n",
-       "      [4050/4050 2:16:04, Epoch 49/50]\n",
        "    </div>\n",
        "    <table border=\"1\" class=\"dataframe\">\n",
        "  <thead>\n",
@@ -20191,63 +20171,63 @@
        "  <tbody>\n",
        "    <tr>\n",
        "      <td>400</td>\n",
-       "      <td>5.204900</td>\n",
-       "      <td>4.556981</td>\n",
-       "      <td>1.000000</td>\n",
        "    </tr>\n",
        "    <tr>\n",
        "      <td>800</td>\n",
-       "      <td>3.569000</td>\n",
-       "      <td>3.541533</td>\n",
-       "      <td>1.000000</td>\n",
        "    </tr>\n",
        "    <tr>\n",
        "      <td>1200</td>\n",
-       "      <td>3.483000</td>\n",
-       "      <td>3.395552</td>\n",
-       "      <td>1.000000</td>\n",
        "    </tr>\n",
        "    <tr>\n",
        "      <td>1600</td>\n",
-       "      <td>2.190600</td>\n",
-       "      <td>1.173165</td>\n",
-       "      <td>0.789678</td>\n",
        "    </tr>\n",
        "    <tr>\n",
        "      <td>2000</td>\n",
-       "      <td>1.796800</td>\n",
-       "      <td>0.763436</td>\n",
-       "      <td>0.667831</td>\n",
        "    </tr>\n",
        "    <tr>\n",
        "      <td>2400</td>\n",
-       "      <td>1.615000</td>\n",
-       "      <td>0.618224</td>\n",
-       "      <td>0.592161</td>\n",
        "    </tr>\n",
        "    <tr>\n",
        "      <td>2800</td>\n",
-       "      <td>1.520000</td>\n",
-       "      <td>0.547277</td>\n",
-       "      <td>0.547924</td>\n",
        "    </tr>\n",
        "    <tr>\n",
        "      <td>3200</td>\n",
-       "      <td>1.469600</td>\n",
-       "      <td>0.500246</td>\n",
-       "      <td>0.513000</td>\n",
        "    </tr>\n",
        "    <tr>\n",
        "      <td>3600</td>\n",
-       "      <td>1.417500</td>\n",
-       "      <td>0.475214</td>\n",
-       "      <td>0.502134</td>\n",
        "    </tr>\n",
        "    <tr>\n",
        "      <td>4000</td>\n",
-       "      <td>1.394300</td>\n",
-       "      <td>0.463765</td>\n",
-       "      <td>0.494373</td>\n",
        "    </tr>\n",
        "  </tbody>\n",
        "</table><p>"
@@ -20266,196 +20246,35 @@
       "The following columns in the evaluation set  don't have a corresponding argument in `Wav2Vec2ForCTC.forward` and have been ignored: input_length.\n",
       "***** Running Evaluation *****\n",
       "  Num examples = 291\n",
-      "  Batch size = 8\n"
-     ]
-    },
-    {
-     "name": "stdout",
-     "output_type": "stream",
-     "text": [
-      "pred :  [72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0]\n",
-      "label:  [30 63 45  0 11 43  6 64  0 25 62 49 16 49  0 20 58  0 23 54 28  0 11 55\n",
-      " 28  0 21 70 27 51  0 42 70 26  0 13 48 21  0 30 25 70 24 43 27 61  0  3\n",
-      " 70 27 52  5  0 30  5 70 31 43 27 46 25  0 26  1  0 18 58  0 42 70 26  0\n",
-      " 25 62 49 26  0 20 58  0 25 70 11 48 59  0 29 16 70 11  0 30 59 27 57  5\n",
-      " 33 15 70 11 55 16 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72]\n",
-      "-----------------\n"
-     ]
-    },
-    {
-     "name": "stderr",
-     "output_type": "stream",
-     "text": [
       "Saving model checkpoint to ./checkpoint-400\n",
       "Configuration saved in ./checkpoint-400/config.json\n",
       "Model weights saved in ./checkpoint-400/pytorch_model.bin\n",
       "Configuration saved in ./checkpoint-400/preprocessor_config.json\n",
       "The following columns in the evaluation set  don't have a corresponding argument in `Wav2Vec2ForCTC.forward` and have been ignored: input_length.\n",
       "***** Running Evaluation *****\n",
       "  Num examples = 291\n",
-      "  Batch size = 8\n"
-     ]
-    },
-    {
-     "name": "stdout",
-     "output_type": "stream",
-     "text": [
-      "pred :  [72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0]\n",
-      "label:  [30 63 45  0 11 43  6 64  0 25 62 49 16 49  0 20 58  0 23 54 28  0 11 55\n",
-      " 28  0 21 70 27 51  0 42 70 26  0 13 48 21  0 30 25 70 24 43 27 61  0  3\n",
-      " 70 27 52  5  0 30  5 70 31 43 27 46 25  0 26  1  0 18 58  0 42 70 26  0\n",
-      " 25 62 49 26  0 20 58  0 25 70 11 48 59  0 29 16 70 11  0 30 59 27 57  5\n",
-      " 33 15 70 11 55 16 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72]\n",
-      "-----------------\n"
-     ]
-    },
-    {
-     "name": "stderr",
-     "output_type": "stream",
-     "text": [
       "Saving model checkpoint to ./checkpoint-800\n",
       "Configuration saved in ./checkpoint-800/config.json\n",
       "Model weights saved in ./checkpoint-800/pytorch_model.bin\n",
       "Configuration saved in ./checkpoint-800/preprocessor_config.json\n",
       "The following columns in the evaluation set  don't have a corresponding argument in `Wav2Vec2ForCTC.forward` and have been ignored: input_length.\n",
       "***** Running Evaluation *****\n",
       "  Num examples = 291\n",
-      "  Batch size = 8\n"
-     ]
-    },
-    {
-     "name": "stdout",
-     "output_type": "stream",
-     "text": [
-      "pred :  [ 1 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0]\n",
-      "label:  [30 63 45  0 11 43  6 64  0 25 62 49 16 49  0 20 58  0 23 54 28  0 11 55\n",
-      " 28  0 21 70 27 51  0 42 70 26  0 13 48 21  0 30 25 70 24 43 27 61  0  3\n",
-      " 70 27 52  5  0 30  5 70 31 43 27 46 25  0 26  1  0 18 58  0 42 70 26  0\n",
-      " 25 62 49 26  0 20 58  0 25 70 11 48 59  0 29 16 70 11  0 30 59 27 57  5\n",
-      " 33 15 70 11 55 16 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72]\n",
-      "-----------------\n"
-     ]
-    },
-    {
-     "name": "stderr",
-     "output_type": "stream",
-     "text": [
       "Saving model checkpoint to ./checkpoint-1200\n",
       "Configuration saved in ./checkpoint-1200/config.json\n",
       "Model weights saved in ./checkpoint-1200/pytorch_model.bin\n",
       "Configuration saved in ./checkpoint-1200/preprocessor_config.json\n",
-      "Deleting older checkpoint [checkpoint-500] due to args.save_total_limit\n",
       "The following columns in the evaluation set  don't have a corresponding argument in `Wav2Vec2ForCTC.forward` and have been ignored: input_length.\n",
       "***** Running Evaluation *****\n",
       "  Num examples = 291\n",
-      "  Batch size = 8\n"
-     ]
-    },
-    {
-     "name": "stdout",
-     "output_type": "stream",
-     "text": [
-      "pred :  [30 45 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 11 43 72 72 72 72  6 26  0 72 25 72 72 72 72\n",
-      " 72 72 72 18 49 72 72 72 72 72  0 72 20 58 72 72  0  0 72 72 72 23 54 72\n",
-      " 72 72  0 72 11 55 72 72 28  0  0 72 72 21 70 70 27 43 72 72 72 72 72 72\n",
-      "  0  0  0 33 72 72 72 72 26 26 72 72 11 48 72 72 72 21 21 64  0 72 72 30\n",
-      " 72 72 72 72 72 72 59 72 72 72 23 54 72 72 72 27 27 72 72 72 72 72  1 72\n",
-      " 72  0  0 72 72 72 72 72  3 70 27 27 50 72 72 72  5  0  0 72 30 30 44 72\n",
-      " 72  5  5 70 72 31 31 43 72 72 72 72 72 72 72 72 27 27 72 72 72 72 72 25\n",
-      " 72  0  0 72 26 72 72 72 72 72 72 72  1  0 72 72 18 58 72  0  0  0 33 72\n",
-      " 72 72 72 72 72 72 26 26  0  0 72 72 25 25 49 72 72 72 72 72 72 72 72 26\n",
-      "  0  0 72 20 58 72 72 72 72  0  0 21 25 72 70 70 70 72 11 72 72 72 72 72\n",
-      " 72 59 72 72 72 72 72 29 72 72 72 72 72 70 70 16  0  0 30 30 72 72 72 25\n",
-      " 70 70 72 27 48 72 72 72  5  5  0 33 72 72 20 70 70 70 70 11 55 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 16 16  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0]\n",
-      "label:  [30 63 45  0 11 43  6 64  0 25 62 49 16 49  0 20 58  0 23 54 28  0 11 55\n",
-      " 28  0 21 70 27 51  0 42 70 26  0 13 48 21  0 30 25 70 24 43 27 61  0  3\n",
-      " 70 27 52  5  0 30  5 70 31 43 27 46 25  0 26  1  0 18 58  0 42 70 26  0\n",
-      " 25 62 49 26  0 20 58  0 25 70 11 48 59  0 29 16 70 11  0 30 59 27 57  5\n",
-      " 33 15 70 11 55 16 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72]\n",
-      "-----------------\n"
-     ]
-    },
-    {
-     "name": "stderr",
-     "output_type": "stream",
-     "text": [
       "Saving model checkpoint to ./checkpoint-1600\n",
       "Configuration saved in ./checkpoint-1600/config.json\n",
       "Model weights saved in ./checkpoint-1600/pytorch_model.bin\n",
@@ -20464,48 +20283,7 @@
       "The following columns in the evaluation set  don't have a corresponding argument in `Wav2Vec2ForCTC.forward` and have been ignored: input_length.\n",
       "***** Running Evaluation *****\n",
       "  Num examples = 291\n",
-      "  Batch size = 8\n"
-     ]
-    },
-    {
-     "name": "stdout",
-     "output_type": "stream",
-     "text": [
-      "pred :  [30 63 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 45 72 72 72 72 72 11 43 72 72 72 72  6 72  0 72 25 72 72 72 72\n",
-      " 72 72 72 16 72 72 72 72 72 72  0 72 20 58 72 72  0  0 72 72 72 23 54 72\n",
-      " 72 27  0 72 11 55 72 72 28  0  0 72 72 21 70 72 27 51 72 72 72 72 72 72\n",
-      "  0  0  0 33 70 72 72 72 26  0  0 72 72 11 72 72 72 21 21 64  0 72 72 30\n",
-      " 30 72 72 72 72 72 59 72 72 72 23 54 72 72 72 27 27 72 72 72 72  1 72 72\n",
-      " 72 72 72 72 72 72 72 72  3 70 72 27 50 72 72 72  5  0  0 72 30 30 44 72\n",
-      " 72  5 70 70 70 31 31 43 72 72 72 72 72 72 72 72 27 27 44 72 72 72 72 25\n",
-      " 72  0  0 72 26 72 72 72 72 72 72  1  1  0 72 72 18 58 72  0  0  0 33 70\n",
-      " 70 72 72 72 72 72 26 72  0  0 72 72 72 25 50 72 72 72 72 72 72 72 26 26\n",
-      "  0  0 72 20 58 72 72 72 72  0  0 72 25 72 72 70 70 72 11 72 72 72 72 72\n",
-      " 72 59 72  0  0 72 72 29 29 16 72 72 72 70 16 16  0  0 72 30 72 72 72 25\n",
-      " 70 70 72 27 72 72 72 72  5  5  0 33 72 72 15 70 70 70 72 11 55 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 16 16  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0]\n",
-      "label:  [30 63 45  0 11 43  6 64  0 25 62 49 16 49  0 20 58  0 23 54 28  0 11 55\n",
-      " 28  0 21 70 27 51  0 42 70 26  0 13 48 21  0 30 25 70 24 43 27 61  0  3\n",
-      " 70 27 52  5  0 30  5 70 31 43 27 46 25  0 26  1  0 18 58  0 42 70 26  0\n",
-      " 25 62 49 26  0 20 58  0 25 70 11 48 59  0 29 16 70 11  0 30 59 27 57  5\n",
-      " 33 15 70 11 55 16 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72]\n",
-      "-----------------\n"
-     ]
-    },
-    {
-     "name": "stderr",
-     "output_type": "stream",
-     "text": [
       "Saving model checkpoint to ./checkpoint-2000\n",
       "Configuration saved in ./checkpoint-2000/config.json\n",
       "Model weights saved in ./checkpoint-2000/pytorch_model.bin\n",
@@ -20514,48 +20292,7 @@
       "The following columns in the evaluation set  don't have a corresponding argument in `Wav2Vec2ForCTC.forward` and have been ignored: input_length.\n",
       "***** Running Evaluation *****\n",
       "  Num examples = 291\n",
-      "  Batch size = 8\n"
-     ]
-    },
-    {
-     "name": "stdout",
-     "output_type": "stream",
-     "text": [
-      "pred :  [30 63 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 45 72 72 72 72 72 72 11 43 72 72 72 72  6  6  0 72 25 62 72 72 72\n",
-      " 72 72 72 16 49 72 72 72 72 72  0 72 20 58 72 72  0  0 72 72 23 54 72 72\n",
-      " 72 28  0 11 55 72 72 72 28  0  0 72 72 21 70 70 27 51 72 72 72 72 72 72\n",
-      "  0  0  0 33 70 72 72 72 26  0  0 72 11 72 72 72 72 21 21 64  0 72 72 30\n",
-      " 72 72 72 72 72 72 59 72 72 72 23 54 72 72 72 27 27 72 72 72 72  1 72 72\n",
-      "  0  0 72 72 72 72 72 72  3 70 72 27 52 72 72 72  5  0  0 72 30 30 44 72\n",
-      " 72  5 70 70 72 31 43 72 72 72 72 72 72 72 72 72 27 44 44 72 72 72 25 25\n",
-      " 72  0  0 72 26 26 72 72 72 72 72  1  1  0 72 18 58 72 72  0  0 72 33 70\n",
-      " 72 72 72 72 72 72 26 72  0  0 72 72 72 25 49 72 72 72 72 72 72 72 26 26\n",
-      "  0  0 72 20 58 72 72 72 72 72  0 72 25 72 70 70 72 11 48 72 72 72 72 72\n",
-      " 59 59 72  0  0 72 72 29 16 16 72 72 70 70 16 72  0  0 30 30 72 72 72 25\n",
-      " 70 70 72 27 72 72 72 72  5 72  0 33 72 72 15 70 70 72 11 55 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 16 16  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0]\n",
-      "label:  [30 63 45  0 11 43  6 64  0 25 62 49 16 49  0 20 58  0 23 54 28  0 11 55\n",
-      " 28  0 21 70 27 51  0 42 70 26  0 13 48 21  0 30 25 70 24 43 27 61  0  3\n",
-      " 70 27 52  5  0 30  5 70 31 43 27 46 25  0 26  1  0 18 58  0 42 70 26  0\n",
-      " 25 62 49 26  0 20 58  0 25 70 11 48 59  0 29 16 70 11  0 30 59 27 57  5\n",
-      " 33 15 70 11 55 16 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72]\n",
-      "-----------------\n"
-     ]
-    },
-    {
-     "name": "stderr",
-     "output_type": "stream",
-     "text": [
       "Saving model checkpoint to ./checkpoint-2400\n",
       "Configuration saved in ./checkpoint-2400/config.json\n",
       "Model weights saved in ./checkpoint-2400/pytorch_model.bin\n",
@@ -20564,48 +20301,7 @@
       "The following columns in the evaluation set  don't have a corresponding argument in `Wav2Vec2ForCTC.forward` and have been ignored: input_length.\n",
       "***** Running Evaluation *****\n",
       "  Num examples = 291\n",
-      "  Batch size = 8\n"
-     ]
-    },
-    {
-     "name": "stdout",
-     "output_type": "stream",
-     "text": [
-      "pred :  [30 63 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 11 54 72 72 72 72  6 72  0 72 25 62 49 72 72\n",
-      " 72 72 72 16 49 72 72 72 72 72  0 72 20 58 72 72  0  0 72 72 23 54 72 72\n",
-      " 72 28  0 72 11 55 72 72 28  0  0 72 72 21 70 70 27 51 72 72 72 72 72 72\n",
-      "  0  0  0 33 70 72 72 72 26  0  0 72 11 48 72 72 72 21 21 64  0 72 72 30\n",
-      " 72 72 72 72 72 25 59 72 72 72 23 54 72 72 72 27 27 72 72 72 72 72 72 72\n",
-      "  0  0 72 72 72 72 72 72  3 70 72 27 52 72 72 72  5  0  0 72 30 30 44 72\n",
-      " 72  5 70 70 72 31 43 72 72 72 72 72 72 72 72 72 27 44 72 72 72 72 25 25\n",
-      " 72  0  0 72 26 72 72 72 72 72 72  1  1  0 72 18 58 72 72  0  0  0 33 70\n",
-      " 72 72 72 72 72 72 26 72  0  0 72 72 72 25 50 72 72 72 72 72 72 72 26 26\n",
-      "  0  0 72 20 58 72 72 72 72 72  0 72 25 72 70 70 70 72 11 48 72 72 72 72\n",
-      " 72 59 72  0  0 72 72 29 16 72 72 72 70 70 16 72  0  0 30 30 72 72 72 25\n",
-      " 70 72 72 27 72 72 72 72  5 72  0 33 72 72 15 70 70 72 11 55 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 16 16  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0]\n",
-      "label:  [30 63 45  0 11 43  6 64  0 25 62 49 16 49  0 20 58  0 23 54 28  0 11 55\n",
-      " 28  0 21 70 27 51  0 42 70 26  0 13 48 21  0 30 25 70 24 43 27 61  0  3\n",
-      " 70 27 52  5  0 30  5 70 31 43 27 46 25  0 26  1  0 18 58  0 42 70 26  0\n",
-      " 25 62 49 26  0 20 58  0 25 70 11 48 59  0 29 16 70 11  0 30 59 27 57  5\n",
-      " 33 15 70 11 55 16 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72]\n",
-      "-----------------\n"
-     ]
-    },
-    {
-     "name": "stderr",
-     "output_type": "stream",
-     "text": [
       "Saving model checkpoint to ./checkpoint-2800\n",
       "Configuration saved in ./checkpoint-2800/config.json\n",
       "Model weights saved in ./checkpoint-2800/pytorch_model.bin\n",
@@ -20614,48 +20310,7 @@
       "The following columns in the evaluation set  don't have a corresponding argument in `Wav2Vec2ForCTC.forward` and have been ignored: input_length.\n",
       "***** Running Evaluation *****\n",
       "  Num examples = 291\n",
-      "  Batch size = 8\n"
-     ]
-    },
-    {
-     "name": "stdout",
-     "output_type": "stream",
-     "text": [
-      "pred :  [30 63 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 11 43 72 72 72 72  6 72  0 72 25 62 62 49 72\n",
-      " 72 72 72 16 49 72 72 72 72  0  0 72 20 58 72 72  0  0 72 72 23 54 18 72\n",
-      " 72 28  0 72 11 55 72 28 28  0  0 72 72 21 70 72 27 51 72 72 72 72 72 72\n",
-      "  0  0  0 33 70 72 72 72 26  0  0 72 11 48 72 72 72 21 64 64  0 72 72 30\n",
-      " 72 72 72 72 72 25 72 72 72 72 23 54 72 72 72 27 27 72 72 72 72  1 72 72\n",
-      "  0  0 72 72 72 72 72 72  3 70 27 52 72 72 72 72  5  0  0 72 30 30 44 72\n",
-      " 72  5 70 70 72 31 43 72 72 72 72 72 72 72 72 72 27 44 72 72 72 72 25 72\n",
-      " 72  0  0 72 26 26 72 72 72 72 72  1 72  0 72 18 58 72 72  0  0  0 33 70\n",
-      " 72 72 72 72 72 72 26 72  0  0 72 72 72 25 50 72 72 72 72 72 72 72 26 72\n",
-      "  0  0 72 20 58 72 72 72 72  0  0 72 25 72 72 70 70 72 11 48 72 72 72 72\n",
-      " 59 72 72  0  0 72 72 29 16 72 72 72 70 70 16 72  0  0 72 30 72 72 72 25\n",
-      " 70 72 27 27 72 72 72 72  5 72  0 33 72 72 15 70 70 72 11 55 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 16 16  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0]\n",
-      "label:  [30 63 45  0 11 43  6 64  0 25 62 49 16 49  0 20 58  0 23 54 28  0 11 55\n",
-      " 28  0 21 70 27 51  0 42 70 26  0 13 48 21  0 30 25 70 24 43 27 61  0  3\n",
-      " 70 27 52  5  0 30  5 70 31 43 27 46 25  0 26  1  0 18 58  0 42 70 26  0\n",
-      " 25 62 49 26  0 20 58  0 25 70 11 48 59  0 29 16 70 11  0 30 59 27 57  5\n",
-      " 33 15 70 11 55 16 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72]\n",
-      "-----------------\n"
-     ]
-    },
-    {
-     "name": "stderr",
-     "output_type": "stream",
-     "text": [
       "Saving model checkpoint to ./checkpoint-3200\n",
       "Configuration saved in ./checkpoint-3200/config.json\n",
       "Model weights saved in ./checkpoint-3200/pytorch_model.bin\n",
@@ -20664,48 +20319,7 @@
       "The following columns in the evaluation set  don't have a corresponding argument in `Wav2Vec2ForCTC.forward` and have been ignored: input_length.\n",
       "***** Running Evaluation *****\n",
       "  Num examples = 291\n",
-      "  Batch size = 8\n"
-     ]
-    },
-    {
-     "name": "stdout",
-     "output_type": "stream",
-     "text": [
-      "pred :  [30 63 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 11 44 72 72 72 72  6 72  0 72 25 62 49 49 72\n",
-      " 72 72 72 16 49 72 72 72 72 72  0 72 20 58 72 72  0  0 72 72 23 54 72 72\n",
-      " 72 28  0 72 11 55 72 72 28  0  0 72 72 21 70 70 27 51 72 72 72 72 72 72\n",
-      "  0  0  0 42 70 72 72 72 26  0  0 72 11 48 72 72 72 21 64 64  0 72 72 30\n",
-      " 72 72 72 72 72 25 72 72 72 72 23 54 72 72 72 27 27 72 72 72 72 72 72 72\n",
-      "  0  0 72 72 72 72 72 72  3 70 72 27 52 72 72 72  5  0  0 72 30 30 44 72\n",
-      " 72  5 70 70 72 31 43 72 72 72 72 72 72 72 72 72 27 44 72 72 72 72 25 72\n",
-      " 72  0  0 72 72 26 72 72 72 72 72  1 72  0 72 18 58 72 72  0  0  0 33 70\n",
-      " 72 72 72 72 72 72 26 72  0  0 72 72 72 25 50 72 72 72 72 72 72 72 26 26\n",
-      "  0  0 72 20 58 72 72 72 72 72  0 72 25 72 72 70 70 72 11 48 72 72 72 72\n",
-      " 72 59 72 72  0 72 72 29 16 72 72 72 72 70 16 72  0  0 72 30 72 72 72 25\n",
-      " 70 72 72 27 72 72 72 72  5 72  0 33 72 72 15 70 70 72 11 55 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 16 16  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0]\n",
-      "label:  [30 63 45  0 11 43  6 64  0 25 62 49 16 49  0 20 58  0 23 54 28  0 11 55\n",
-      " 28  0 21 70 27 51  0 42 70 26  0 13 48 21  0 30 25 70 24 43 27 61  0  3\n",
-      " 70 27 52  5  0 30  5 70 31 43 27 46 25  0 26  1  0 18 58  0 42 70 26  0\n",
-      " 25 62 49 26  0 20 58  0 25 70 11 48 59  0 29 16 70 11  0 30 59 27 57  5\n",
-      " 33 15 70 11 55 16 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72]\n",
-      "-----------------\n"
-     ]
-    },
-    {
-     "name": "stderr",
-     "output_type": "stream",
-     "text": [
       "Saving model checkpoint to ./checkpoint-3600\n",
       "Configuration saved in ./checkpoint-3600/config.json\n",
       "Model weights saved in ./checkpoint-3600/pytorch_model.bin\n",
@@ -20714,48 +20328,7 @@
       "The following columns in the evaluation set  don't have a corresponding argument in `Wav2Vec2ForCTC.forward` and have been ignored: input_length.\n",
       "***** Running Evaluation *****\n",
       "  Num examples = 291\n",
-      "  Batch size = 8\n"
-     ]
-    },
-    {
-     "name": "stdout",
-     "output_type": "stream",
-     "text": [
-      "pred :  [30 63 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 11 43 72 72 72 72  6 72  0 72 25 62 62 49 72\n",
-      " 72 72 72 16 49 72 72 72 72 72  0 72 20 58 72 72  0  0 72 72 23 54 18 72\n",
-      " 72 28  0  0 11 55 72 72 28  0  0 72 72 21 70 70 27 51 72 72 72 72 72 72\n",
-      "  0  0  0 42 70 72 72 72 26  0  0 72 11 48 72 72 72 21 21 64  0 72 72 30\n",
-      " 30 72 72 72 72 25 72 72 72 72 23 54 72 72 72 27 27 72 72 72 72 72 72 72\n",
-      "  0  0 72 72 72 72 72 72  3 70 72 27 52 72 72 72  5  0  0 72 30 30 44 72\n",
-      " 72  5 70 70 72 31 43 72 72 72 72 72 72 72 72 72 27 46 72 72 72 72 25 72\n",
-      " 72  0  0 72 72 26 72 72 72 72 72  1 72  0 72 18 58 72 72  0  0  0 33 70\n",
-      " 72 72 72 72 72 72 26 72  0  0 72 72 72 25 50 72 72 72 72 72 72 72 26 26\n",
-      "  0  0 72 20 58 72 72 72 72 72  0 72 25 72 70 70 70 72 11 48 72 72 72 72\n",
-      " 72 59 72 72  0 72 72 29 16 72 72 72 70 70 16 72  0  0 72 30 72 72 72 25\n",
-      " 72 72 72 27 72 72 72 72  5 72  0 33 72 72 15 70 70 72 72 12 55 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 16 16  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\n",
-      "  0  0  0  0  0  0  0]\n",
-      "label:  [30 63 45  0 11 43  6 64  0 25 62 49 16 49  0 20 58  0 23 54 28  0 11 55\n",
-      " 28  0 21 70 27 51  0 42 70 26  0 13 48 21  0 30 25 70 24 43 27 61  0  3\n",
-      " 70 27 52  5  0 30  5 70 31 43 27 46 25  0 26  1  0 18 58  0 42 70 26  0\n",
-      " 25 62 49 26  0 20 58  0 25 70 11 48 59  0 29 16 70 11  0 30 59 27 57  5\n",
-      " 33 15 70 11 55 16 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72 72\n",
-      " 72 72 72 72 72 72 72]\n",
-      "-----------------\n"
-     ]
-    },
-    {
-     "name": "stderr",
-     "output_type": "stream",
-     "text": [
       "Saving model checkpoint to ./checkpoint-4000\n",
       "Configuration saved in ./checkpoint-4000/config.json\n",
       "Model weights saved in ./checkpoint-4000/pytorch_model.bin\n",
@@ -20766,16 +20339,16 @@
       "Training completed. Do not forget to share your model on huggingface.co/models =)\n",
       "\n",
       "\n",
-      "Loading best model from ./checkpoint-4000 (score: 0.46376487612724304).\n"
      ]
     },
     {
      "data": {
       "text/plain": [
-       "TrainOutput(global_step=4050, training_loss=2.89372775796019, metrics={'train_runtime': 8168.6927, 'train_samples_per_second': 16.006, 'train_steps_per_second': 0.496, 'total_flos': 1.9735608328149316e+19, 'train_loss': 2.89372775796019, 'epoch': 49.99})"
       ]
      },
-     "execution_count": 49,
      "metadata": {},
      "output_type": "execute_result"
     }
@@ -20787,7 +20360,7 @@
   {
    "cell_type": "code",
    "execution_count": 57,
-   "id": "57c2527b",
    "metadata": {},
    "outputs": [
     {
@@ -20806,8 +20379,8 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 53,
-   "id": "0211e267",
    "metadata": {},
    "outputs": [],
    "source": [
@@ -20823,8 +20396,8 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 54,
-   "id": "62f6fd3e",
    "metadata": {},
    "outputs": [
     {
@@ -20842,8 +20415,8 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 60,
-   "id": "b050fb9f",
    "metadata": {},
    "outputs": [
     {
@@ -20858,12 +20431,12 @@
     {
      "data": {
       "application/vnd.jupyter.widget-view+json": {
-       "model_id": "331db7acce774ee3b699aa82a0451092",
        "version_major": 2,
        "version_minor": 0
       },
       "text/plain": [
-       "Download file pytorch_model.bin:   0%|          | 3.47k/1.18G [00:00<?, ?B/s]"
       ]
      },
      "metadata": {},
@@ -20872,7 +20445,35 @@
     {
      "data": {
       "application/vnd.jupyter.widget-view+json": {
-       "model_id": "db90465291c64e9f82988698d2473234",
        "version_major": 2,
        "version_minor": 0
       },
@@ -20890,6 +20491,39 @@
       "Configuration saved in vitouphy/xls-r-300m-km/config.json\n",
       "Model weights saved in vitouphy/xls-r-300m-km/pytorch_model.bin\n"
      ]
     }
    ],
    "source": [
@@ -20898,8 +20532,8 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 61,
-   "id": "9d7cb173",
    "metadata": {},
    "outputs": [
     {
@@ -20920,7 +20554,7 @@
   {
    "cell_type": "code",
    "execution_count": null,
-   "id": "8dc01ad4",
    "metadata": {},
    "outputs": [],
    "source": []

   {
    "cell_type": "code",
    "execution_count": 1,
+   "id": "bff05704",
    "metadata": {},
    "outputs": [],
    "source": [
   {
    "cell_type": "code",
    "execution_count": null,
+   "id": "9637cdfd",
    "metadata": {
     "collapsed": true,
     "jupyter": {
   },
   {
    "cell_type": "markdown",
+   "id": "b11b1d53",
    "metadata": {},
    "source": [
     "### Load KH Data"
   {
    "cell_type": "code",
    "execution_count": 4,
+   "id": "f35b6d68",
    "metadata": {},
    "outputs": [],
    "source": [
   {
    "cell_type": "code",
    "execution_count": 5,
+   "id": "a0b561cb",
    "metadata": {},
    "outputs": [
     {
   {
    "cell_type": "code",
    "execution_count": 6,
+   "id": "c8ae4532",
    "metadata": {},
    "outputs": [],
    "source": [
   },
   {
    "cell_type": "markdown",
+   "id": "4649ca2b",
    "metadata": {},
    "source": [
     "### Clean Up the Text"
   {
    "cell_type": "code",
    "execution_count": 6,
+   "id": "363283a2",
    "metadata": {},
    "outputs": [],
    "source": [
   {
    "cell_type": "code",
    "execution_count": 7,
+   "id": "51f70aa8",
    "metadata": {
     "collapsed": true,
     "jupyter": {
   {
    "cell_type": "code",
    "execution_count": 7,
+   "id": "fbc089d7",
    "metadata": {},
    "outputs": [
     {
   },
   {
    "cell_type": "markdown",
+   "id": "af02801f",
    "metadata": {},
    "source": [
     "### Build Character"
   {
    "cell_type": "code",
    "execution_count": 8,
+   "id": "a9e58b43",
    "metadata": {},
    "outputs": [
     {
   {
    "cell_type": "code",
    "execution_count": 9,
+   "id": "4480543c",
    "metadata": {},
    "outputs": [],
    "source": [
   {
    "cell_type": "code",
    "execution_count": 10,
+   "id": "99857f4d",
    "metadata": {},
    "outputs": [
     {
   {
    "cell_type": "code",
    "execution_count": 11,
+   "id": "bec53215",
    "metadata": {},
    "outputs": [
     {
   {
    "cell_type": "code",
    "execution_count": 12,
+   "id": "cf58f8a4",
    "metadata": {},
    "outputs": [
     {
   {
    "cell_type": "code",
    "execution_count": 13,
+   "id": "0c621a15",
    "metadata": {},
    "outputs": [],
    "source": [
   },
   {
    "cell_type": "markdown",
+   "id": "bb8b5aa3",
    "metadata": {},
    "source": [
     "# Tokenizer"
   {
    "cell_type": "code",
    "execution_count": 14,
+   "id": "dc1c1984",
    "metadata": {},
    "outputs": [],
    "source": [
   },
   {
    "cell_type": "code",
+   "execution_count": 63,
+   "id": "6324377d",
    "metadata": {},
    "outputs": [
     {
       "loading file ./tokenizer_config.json\n",
       "loading file ./added_tokens.json\n",
       "loading file ./special_tokens_map.json\n",
+      "loading file None\n",
+      "Adding <s> to the vocabulary\n",
+      "Adding </s> to the vocabulary\n"
      ]
     }
    ],
   {
    "cell_type": "code",
    "execution_count": 26,
+   "id": "f971580d",
    "metadata": {},
    "outputs": [],
    "source": [
   {
    "cell_type": "code",
    "execution_count": 27,
+   "id": "d0368c7a",
    "metadata": {},
    "outputs": [
     {
   {
    "cell_type": "code",
    "execution_count": 17,
+   "id": "62e9d0c6",
    "metadata": {},
    "outputs": [],
    "source": [
   {
    "cell_type": "code",
    "execution_count": 18,
+   "id": "f642a861",
    "metadata": {},
    "outputs": [
     {
   {
    "cell_type": "code",
    "execution_count": 19,
+   "id": "0c756a07",
    "metadata": {},
    "outputs": [
     {
   {
    "cell_type": "code",
    "execution_count": 20,
+   "id": "d2a5374c",
    "metadata": {},
    "outputs": [],
    "source": [
   {
    "cell_type": "code",
    "execution_count": 22,
+   "id": "9c3697ba",
    "metadata": {},
    "outputs": [],
    "source": [
   {
    "cell_type": "code",
    "execution_count": 41,
+   "id": "d5bd0662",
    "metadata": {},
    "outputs": [],
    "source": [
   {
    "cell_type": "code",
    "execution_count": 25,
+   "id": "639dd5a7",
    "metadata": {},
    "outputs": [],
    "source": [
   {
    "cell_type": "code",
    "execution_count": 26,
+   "id": "c4fe1643",
    "metadata": {},
    "outputs": [],
    "source": [
   {
    "cell_type": "code",
    "execution_count": 27,
+   "id": "9fb388e3",
    "metadata": {},
    "outputs": [],
    "source": [
   },
   {
    "cell_type": "code",
+   "execution_count": 64,
+   "id": "96611455",
    "metadata": {},
    "outputs": [],
    "source": [
     "    pred_str = tokenizer.batch_decode(pred_ids)\n",
     "    label_str = tokenizer.batch_decode(pred.label_ids, group_tokens=False)\n",
     "\n",
+    "#     print(\"pred : \", pred_ids[0])\n",
+    "#     print(\"label: \", pred.label_ids[0])\n",
+    "#     print(\"-----------------\")\n",
     "    \n",
     "    wer = wer_metric.compute(predictions=pred_str, references=label_str)\n",
     "\n",
   },
   {
    "cell_type": "code",
+   "execution_count": 66,
+   "id": "bb429520",
    "metadata": {
     "collapsed": true,
     "jupyter": {
      "name": "stderr",
      "output_type": "stream",
      "text": [
+      "loading configuration file checkpoint-4000/config.json\n",
       "Model config Wav2Vec2Config {\n",
+      "  \"_name_or_path\": \"facebook/wav2vec2-xls-r-300m\",\n",
       "  \"activation_dropout\": 0.0,\n",
       "  \"adapter_kernel_size\": 3,\n",
       "  \"adapter_stride\": 2,\n",
       "  \"add_adapter\": false,\n",
       "  \"apply_spec_augment\": true,\n",
       "  \"architectures\": [\n",
+      "    \"Wav2Vec2ForCTC\"\n",
       "  ],\n",
       "  \"attention_dropout\": 0.1,\n",
       "  \"bos_token_id\": 1,\n",
       "  \"xvector_output_dim\": 512\n",
       "}\n",
       "\n",
+      "loading weights file checkpoint-4000/pytorch_model.bin\n",
+      "All model checkpoint weights were used when initializing Wav2Vec2ForCTC.\n",
+      "\n",
+      "All the weights of Wav2Vec2ForCTC were initialized from the model checkpoint at checkpoint-4000.\n",
+      "If your task is similar to the task the model of the checkpoint was trained on, you can already use Wav2Vec2ForCTC for predictions without further training.\n"
      ]
     }
    ],
     "from transformers import Wav2Vec2ForCTC\n",
     "\n",
     "model = Wav2Vec2ForCTC.from_pretrained(\n",
+    "#     \"facebook/wav2vec2-xls-r-300m\", \n",
+    "    \"checkpoint-4000\",\n",
     "    attention_dropout=0.1,\n",
     "    layerdrop=0.0,\n",
     "    feat_proj_dropout=0.0,\n",
   },
   {
    "cell_type": "code",
+   "execution_count": 68,
+   "id": "ffcd9012",
    "metadata": {},
    "outputs": [],
    "source": [
   },
   {
    "cell_type": "code",
+   "execution_count": 69,
+   "id": "b07418cf",
    "metadata": {},
    "outputs": [
     {
     "  eval_steps=400,\n",
     "  logging_steps=100,\n",
     "  learning_rate=5e-5,\n",
+    "  warmup_steps=100,\n",
     "  save_total_limit=3,\n",
     "  load_best_model_at_end=True\n",
     ")"
   },
   {
    "cell_type": "code",
+   "execution_count": 70,
+   "id": "7776cd7d",
    "metadata": {},
    "outputs": [
     {
   },
   {
    "cell_type": "code",
+   "execution_count": 71,
+   "id": "ac33ed4c",
+   "metadata": {},
    "outputs": [
     {
      "name": "stderr",
        "    <div>\n",
        "      \n",
        "      <progress value='4050' max='4050' style='width:300px; height:20px; vertical-align: middle;'></progress>\n",
+       "      [4050/4050 2:16:09, Epoch 49/50]\n",
        "    </div>\n",
        "    <table border=\"1\" class=\"dataframe\">\n",
        "  <thead>\n",
        "  <tbody>\n",
        "    <tr>\n",
        "      <td>400</td>\n",
+       "      <td>1.382900</td>\n",
+       "      <td>0.429020</td>\n",
+       "      <td>0.479627</td>\n",
        "    </tr>\n",
        "    <tr>\n",
        "      <td>800</td>\n",
+       "      <td>1.315600</td>\n",
+       "      <td>0.385632</td>\n",
+       "      <td>0.447419</td>\n",
        "    </tr>\n",
        "    <tr>\n",
        "      <td>1200</td>\n",
+       "      <td>1.239600</td>\n",
+       "      <td>0.359977</td>\n",
+       "      <td>0.430733</td>\n",
        "    </tr>\n",
        "    <tr>\n",
        "      <td>1600</td>\n",
+       "      <td>1.144400</td>\n",
+       "      <td>0.342276</td>\n",
+       "      <td>0.417928</td>\n",
        "    </tr>\n",
        "    <tr>\n",
        "      <td>2000</td>\n",
+       "      <td>1.097900</td>\n",
+       "      <td>0.337029</td>\n",
+       "      <td>0.388436</td>\n",
        "    </tr>\n",
        "    <tr>\n",
        "      <td>2400</td>\n",
+       "      <td>1.071400</td>\n",
+       "      <td>0.323725</td>\n",
+       "      <td>0.370974</td>\n",
        "    </tr>\n",
        "    <tr>\n",
        "      <td>2800</td>\n",
+       "      <td>1.044200</td>\n",
+       "      <td>0.333624</td>\n",
+       "      <td>0.368258</td>\n",
        "    </tr>\n",
        "    <tr>\n",
        "      <td>3200</td>\n",
+       "      <td>1.049200</td>\n",
+       "      <td>0.316629</td>\n",
+       "      <td>0.352736</td>\n",
        "    </tr>\n",
        "    <tr>\n",
        "      <td>3600</td>\n",
+       "      <td>1.028400</td>\n",
+       "      <td>0.317763</td>\n",
+       "      <td>0.356616</td>\n",
        "    </tr>\n",
        "    <tr>\n",
        "      <td>4000</td>\n",
+       "      <td>1.030200</td>\n",
+       "      <td>0.314151</td>\n",
+       "      <td>0.351184</td>\n",
        "    </tr>\n",
        "  </tbody>\n",
        "</table><p>"
       "The following columns in the evaluation set  don't have a corresponding argument in `Wav2Vec2ForCTC.forward` and have been ignored: input_length.\n",
       "***** Running Evaluation *****\n",
       "  Num examples = 291\n",
+      "  Batch size = 8\n",
       "Saving model checkpoint to ./checkpoint-400\n",
       "Configuration saved in ./checkpoint-400/config.json\n",
       "Model weights saved in ./checkpoint-400/pytorch_model.bin\n",
       "Configuration saved in ./checkpoint-400/preprocessor_config.json\n",
+      "Deleting older checkpoint [checkpoint-3200] due to args.save_total_limit\n",
+      "Deleting older checkpoint [checkpoint-3600] due to args.save_total_limit\n",
       "The following columns in the evaluation set  don't have a corresponding argument in `Wav2Vec2ForCTC.forward` and have been ignored: input_length.\n",
       "***** Running Evaluation *****\n",
       "  Num examples = 291\n",
+      "  Batch size = 8\n",
       "Saving model checkpoint to ./checkpoint-800\n",
       "Configuration saved in ./checkpoint-800/config.json\n",
       "Model weights saved in ./checkpoint-800/pytorch_model.bin\n",
       "Configuration saved in ./checkpoint-800/preprocessor_config.json\n",
+      "Deleting older checkpoint [checkpoint-4000-prev-best] due to args.save_total_limit\n",
       "The following columns in the evaluation set  don't have a corresponding argument in `Wav2Vec2ForCTC.forward` and have been ignored: input_length.\n",
       "***** Running Evaluation *****\n",
       "  Num examples = 291\n",
+      "  Batch size = 8\n",
       "Saving model checkpoint to ./checkpoint-1200\n",
       "Configuration saved in ./checkpoint-1200/config.json\n",
       "Model weights saved in ./checkpoint-1200/pytorch_model.bin\n",
       "Configuration saved in ./checkpoint-1200/preprocessor_config.json\n",
+      "Deleting older checkpoint [checkpoint-4000] due to args.save_total_limit\n",
       "The following columns in the evaluation set  don't have a corresponding argument in `Wav2Vec2ForCTC.forward` and have been ignored: input_length.\n",
       "***** Running Evaluation *****\n",
       "  Num examples = 291\n",
+      "  Batch size = 8\n",
       "Saving model checkpoint to ./checkpoint-1600\n",
       "Configuration saved in ./checkpoint-1600/config.json\n",
       "Model weights saved in ./checkpoint-1600/pytorch_model.bin\n",
       "The following columns in the evaluation set  don't have a corresponding argument in `Wav2Vec2ForCTC.forward` and have been ignored: input_length.\n",
       "***** Running Evaluation *****\n",
       "  Num examples = 291\n",
+      "  Batch size = 8\n",
       "Saving model checkpoint to ./checkpoint-2000\n",
       "Configuration saved in ./checkpoint-2000/config.json\n",
       "Model weights saved in ./checkpoint-2000/pytorch_model.bin\n",
       "The following columns in the evaluation set  don't have a corresponding argument in `Wav2Vec2ForCTC.forward` and have been ignored: input_length.\n",
       "***** Running Evaluation *****\n",
       "  Num examples = 291\n",
+      "  Batch size = 8\n",
       "Saving model checkpoint to ./checkpoint-2400\n",
       "Configuration saved in ./checkpoint-2400/config.json\n",
       "Model weights saved in ./checkpoint-2400/pytorch_model.bin\n",
       "The following columns in the evaluation set  don't have a corresponding argument in `Wav2Vec2ForCTC.forward` and have been ignored: input_length.\n",
       "***** Running Evaluation *****\n",
       "  Num examples = 291\n",
+      "  Batch size = 8\n",
       "Saving model checkpoint to ./checkpoint-2800\n",
       "Configuration saved in ./checkpoint-2800/config.json\n",
       "Model weights saved in ./checkpoint-2800/pytorch_model.bin\n",
       "The following columns in the evaluation set  don't have a corresponding argument in `Wav2Vec2ForCTC.forward` and have been ignored: input_length.\n",
       "***** Running Evaluation *****\n",
       "  Num examples = 291\n",
+      "  Batch size = 8\n",
       "Saving model checkpoint to ./checkpoint-3200\n",
       "Configuration saved in ./checkpoint-3200/config.json\n",
       "Model weights saved in ./checkpoint-3200/pytorch_model.bin\n",
       "The following columns in the evaluation set  don't have a corresponding argument in `Wav2Vec2ForCTC.forward` and have been ignored: input_length.\n",
       "***** Running Evaluation *****\n",
       "  Num examples = 291\n",
+      "  Batch size = 8\n",
       "Saving model checkpoint to ./checkpoint-3600\n",
       "Configuration saved in ./checkpoint-3600/config.json\n",
       "Model weights saved in ./checkpoint-3600/pytorch_model.bin\n",
       "The following columns in the evaluation set  don't have a corresponding argument in `Wav2Vec2ForCTC.forward` and have been ignored: input_length.\n",
       "***** Running Evaluation *****\n",
       "  Num examples = 291\n",
+      "  Batch size = 8\n",
       "Saving model checkpoint to ./checkpoint-4000\n",
       "Configuration saved in ./checkpoint-4000/config.json\n",
       "Model weights saved in ./checkpoint-4000/pytorch_model.bin\n",
       "Training completed. Do not forget to share your model on huggingface.co/models =)\n",
       "\n",
       "\n",
+      "Loading best model from ./checkpoint-4000 (score: 0.3141506016254425).\n"
      ]
     },
     {
      "data": {
       "text/plain": [
+       "TrainOutput(global_step=4050, training_loss=1.1567209813624253, metrics={'train_runtime': 8173.6251, 'train_samples_per_second': 15.997, 'train_steps_per_second': 0.495, 'total_flos': 1.9735608328149316e+19, 'train_loss': 1.1567209813624253, 'epoch': 49.99})"
       ]
      },
+     "execution_count": 71,
      "metadata": {},
      "output_type": "execute_result"
     }
   {
    "cell_type": "code",
    "execution_count": 57,
+   "id": "19b3350f",
    "metadata": {},
    "outputs": [
     {
   },
   {
    "cell_type": "code",
+   "execution_count": 72,
+   "id": "724e14ef",
    "metadata": {},
    "outputs": [],
    "source": [
   },
   {
    "cell_type": "code",
+   "execution_count": 73,
+   "id": "75b87f11",
    "metadata": {},
    "outputs": [
     {
   },
   {
    "cell_type": "code",
+   "execution_count": 74,
+   "id": "9e4a2ec9",
    "metadata": {},
    "outputs": [
     {
     {
      "data": {
       "application/vnd.jupyter.widget-view+json": {
+       "model_id": "ae4aa0641113454c801089fa2dbd6777",
        "version_major": 2,
        "version_minor": 0
       },
       "text/plain": [
+       "Download file pytorch_model.bin:   0%|          | 2.83k/1.18G [00:00<?, ?B/s]"
       ]
      },
      "metadata": {},
     {
      "data": {
       "application/vnd.jupyter.widget-view+json": {
+       "model_id": "9a3129d18855473ba7da0f290f26419b",
+       "version_major": 2,
+       "version_minor": 0
+      },
+      "text/plain": [
+       "Download file training_args.bin:  63%|######2   | 1.84k/2.92k [00:00<?, ?B/s]"
+      ]
+     },
+     "metadata": {},
+     "output_type": "display_data"
+    },
+    {
+     "data": {
+      "application/vnd.jupyter.widget-view+json": {
+       "model_id": "dabccfa9f14045919cf70a905afb5506",
+       "version_major": 2,
+       "version_minor": 0
+      },
+      "text/plain": [
+       "Clean file training_args.bin:  34%|###4      | 1.00k/2.92k [00:00<?, ?B/s]"
+      ]
+     },
+     "metadata": {},
+     "output_type": "display_data"
+    },
+    {
+     "data": {
+      "application/vnd.jupyter.widget-view+json": {
+       "model_id": "ee7e633b1e784625b2d3695176f6c0f2",
        "version_major": 2,
        "version_minor": 0
       },
       "Configuration saved in vitouphy/xls-r-300m-km/config.json\n",
       "Model weights saved in vitouphy/xls-r-300m-km/pytorch_model.bin\n"
      ]
+    },
+    {
+     "data": {
+      "application/vnd.jupyter.widget-view+json": {
+       "model_id": "9738e4743ca3470f863dfd4d85f6e411",
+       "version_major": 2,
+       "version_minor": 0
+      },
+      "text/plain": [
+       "Upload file pytorch_model.bin:   0%|          | 3.39k/1.18G [00:00<?, ?B/s]"
+      ]
+     },
+     "metadata": {},
+     "output_type": "display_data"
+    },
+    {
+     "name": "stderr",
+     "output_type": "stream",
+     "text": [
+      "To https://huggingface.co/vitouphy/xls-r-300m-km\n",
+      "   6f203d5..74be6ec  main -> main\n",
+      "\n"
+     ]
+    },
+    {
+     "data": {
+      "text/plain": [
+       "'https://huggingface.co/vitouphy/xls-r-300m-km/commit/74be6ece8cca85ef00972b1f3f88460217d0acf5'"
+      ]
+     },
+     "execution_count": 74,
+     "metadata": {},
+     "output_type": "execute_result"
     }
    ],
    "source": [
   },
   {
    "cell_type": "code",
+   "execution_count": 75,
+   "id": "8c70b0b9",
    "metadata": {},
    "outputs": [
     {
   {
    "cell_type": "code",
    "execution_count": null,
+   "id": "96cd8308",
    "metadata": {},
    "outputs": [],
    "source": []

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c93624168d989aabdf33bd32b0dbfa8b32857515b2a2f04190df98bd15ae4e61
 size 2991

 version https://git-lfs.github.com/spec/v1
+oid sha256:59beaed2af6d1171b371d53a9d0077d3ab22cd9f2392cf839539bb2e9f36d978
 size 2991