Bajiyo
/

w2v-bert-2.0-malayalam_mixeddataset_thre

@@ -1,10 +1,10 @@
 ---
 license: mit
 tags:
 - generated_from_trainer
 metrics:
 - wer
-base_model: facebook/w2v-bert-2.0
 model-index:
 - name: w2v-bert-2.0-malayalam_mixeddataset_thre
   results: []
@@ -17,8 +17,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [facebook/w2v-bert-2.0](https://huggingface.co/facebook/w2v-bert-2.0) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1707
-- Wer: 0.1157
 ## Model description
@@ -53,27 +53,27 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step  | Validation Loss | Wer    |
 |:-------------:|:-----:|:-----:|:---------------:|:------:|
-| 1.0465        | 0.47  | 600   | 0.3707          | 0.4561 |
-| 0.167         | 0.95  | 1200  | 0.2639          | 0.3710 |
-| 0.1209        | 1.42  | 1800  | 0.2276          | 0.3143 |
-| 0.1028        | 1.9   | 2400  | 0.2139          | 0.2657 |
-| 0.0831        | 2.37  | 3000  | 0.2015          | 0.2553 |
-| 0.0746        | 2.85  | 3600  | 0.1729          | 0.2374 |
-| 0.0626        | 3.32  | 4200  | 0.1501          | 0.2192 |
-| 0.0547        | 3.8   | 4800  | 0.1824          | 0.2217 |
-| 0.0448        | 4.27  | 5400  | 0.1571          | 0.1747 |
-| 0.0383        | 4.74  | 6000  | 0.1438          | 0.1662 |
-| 0.0353        | 5.22  | 6600  | 0.1476          | 0.1657 |
-| 0.029         | 5.69  | 7200  | 0.1543          | 0.1655 |
-| 0.025         | 6.17  | 7800  | 0.1620          | 0.1553 |
-| 0.02          | 6.64  | 8400  | 0.1481          | 0.1468 |
-| 0.0167        | 7.12  | 9000  | 0.1501          | 0.1376 |
-| 0.0126        | 7.59  | 9600  | 0.1469          | 0.1341 |
-| 0.0122        | 8.07  | 10200 | 0.1565          | 0.1291 |
-| 0.0079        | 8.54  | 10800 | 0.1592          | 0.1242 |
-| 0.0073        | 9.02  | 11400 | 0.1594          | 0.1237 |
-| 0.0039        | 9.49  | 12000 | 0.1710          | 0.1204 |
-| 0.0039        | 9.96  | 12600 | 0.1707          | 0.1157 |
 ### Framework versions

 ---
 license: mit
+base_model: facebook/w2v-bert-2.0
 tags:
 - generated_from_trainer
 metrics:
 - wer
 model-index:
 - name: w2v-bert-2.0-malayalam_mixeddataset_thre
   results: []
 This model is a fine-tuned version of [facebook/w2v-bert-2.0](https://huggingface.co/facebook/w2v-bert-2.0) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1604
+- Wer: 0.1244
 ## Model description
 | Training Loss | Epoch | Step  | Validation Loss | Wer    |
 |:-------------:|:-----:|:-----:|:---------------:|:------:|
+| 1.1974        | 0.47  | 600   | 0.3732          | 0.4971 |
+| 0.1677        | 0.95  | 1200  | 0.2552          | 0.3411 |
+| 0.1229        | 1.42  | 1800  | 0.2184          | 0.3123 |
+| 0.1041        | 1.9   | 2400  | 0.2044          | 0.2921 |
+| 0.0825        | 2.37  | 3000  | 0.2150          | 0.2667 |
+| 0.0756        | 2.85  | 3600  | 0.1882          | 0.2361 |
+| 0.0627        | 3.32  | 4200  | 0.1735          | 0.2493 |
+| 0.0557        | 3.8   | 4800  | 0.1653          | 0.2117 |
+| 0.0454        | 4.27  | 5400  | 0.1669          | 0.1891 |
+| 0.0394        | 4.74  | 6000  | 0.1610          | 0.1903 |
+| 0.0363        | 5.22  | 6600  | 0.1654          | 0.1699 |
+| 0.0278        | 5.69  | 7200  | 0.1465          | 0.1640 |
+| 0.025         | 6.17  | 7800  | 0.1503          | 0.1617 |
+| 0.0198        | 6.64  | 8400  | 0.1429          | 0.1466 |
+| 0.0174        | 7.12  | 9000  | 0.1440          | 0.1453 |
+| 0.013         | 7.59  | 9600  | 0.1496          | 0.1433 |
+| 0.0125        | 8.07  | 10200 | 0.1465          | 0.1274 |
+| 0.0076        | 8.54  | 10800 | 0.1479          | 0.1349 |
+| 0.0076        | 9.02  | 11400 | 0.1521          | 0.1229 |
+| 0.0041        | 9.49  | 12000 | 0.1600          | 0.1291 |
+| 0.0038        | 9.96  | 12600 | 0.1604          | 0.1244 |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:16379c4f7d9a42d73227599d64ad63468a7f1ae92defcc21ebfc4a2b33c191c2
 size 2423130260

 version https://git-lfs.github.com/spec/v1
+oid sha256:8bd4c177d910057368a78b0bea9e01853d393f72f85316e3fb28a752a8ac22e2
 size 2423130260

runs/May06_10-12-05_kudsit-dgxserver/events.out.tfevents.1714971283.kudsit-dgxserver.3772578.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:eea3532c4e7fd4219f8fd403e32092f8cf8e481a20868613909aae09563f6fe6
-size 17041

 version https://git-lfs.github.com/spec/v1
+oid sha256:cf1d312ee2cbddf1f62d8a34e8f9c0a36340afaec9fae2af2c59522c6d881d39
+size 17395