frisibeli
/

legal-roberta-large

Text Classification

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

frisibeli commited on Jan 4

Commit

47af526

•

1 Parent(s): 5425b48

frisibeli/roberta-lexml

Files changed (3) hide show

README.md +9 -9
model.safetensors +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -17,8 +17,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [lexlms/legal-roberta-large](https://huggingface.co/lexlms/legal-roberta-large) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.6295
-- F1: 0.4711
 ## Model description
@@ -37,7 +37,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 3e-05
 - train_batch_size: 4
 - eval_batch_size: 4
 - seed: 100
@@ -45,7 +45,7 @@ The following hyperparameters were used during training:
 - total_train_batch_size: 32
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- lr_scheduler_warmup_steps: 80
 - num_epochs: 5
 - mixed_precision_training: Native AMP
@@ -53,11 +53,11 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | F1     |
 |:-------------:|:-----:|:----:|:---------------:|:------:|
-| 0.6641        | 0.96  | 18   | 0.5623          | 0.4322 |
-| 0.5628        | 1.97  | 37   | 0.5583          | 0.4322 |
-| 0.554         | 2.99  | 56   | 0.6142          | 0.4322 |
-| 0.5071        | 4.0   | 75   | 0.5391          | 0.4866 |
-| 0.3651        | 4.8   | 90   | 0.6295          | 0.4711 |
 ### Framework versions

 This model is a fine-tuned version of [lexlms/legal-roberta-large](https://huggingface.co/lexlms/legal-roberta-large) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.0297
+- F1: 0.4489
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 2e-06
 - train_batch_size: 4
 - eval_batch_size: 4
 - seed: 100
 - total_train_batch_size: 32
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 30
 - num_epochs: 5
 - mixed_precision_training: Native AMP
 | Training Loss | Epoch | Step | Validation Loss | F1     |
 |:-------------:|:-----:|:----:|:---------------:|:------:|
+| 0.2471        | 0.96  | 18   | 0.8701          | 0.4711 |
+| 0.1889        | 1.97  | 37   | 0.9103          | 0.4562 |
+| 0.1444        | 2.99  | 56   | 0.9706          | 0.4489 |
+| 0.1283        | 4.0   | 75   | 1.0204          | 0.4562 |
+| 0.1411        | 4.8   | 90   | 1.0297          | 0.4489 |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4e14328cd319a991f9c5418c194a1c390234f8d841155bd66c5f7626d2cc2590
 size 1420414072

 version https://git-lfs.github.com/spec/v1
+oid sha256:d78a9c9cefb82bc7056a16ffe21c241addac96bda925442989884284b23a7b19
 size 1420414072

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6511186045cd91933c15c7d2628e7e355ffd97fb387aee7918cd7506fa6a7be7
 size 4600

 version https://git-lfs.github.com/spec/v1
+oid sha256:f36765709db3a4b779ec9256d853fad8bfbed6cbe6f9c342265002d69880570b
 size 4600