JerMa88 commited on
Commit
5daa05a
1 Parent(s): 8f11aec

llama-3.2-1B-personality-detection-N

Browse files
README.md CHANGED
@@ -50,14 +50,14 @@ The following `bitsandbytes` quantization config was used during training:
50
  ### Training hyperparameters
51
 
52
  The following hyperparameters were used during training:
53
- - learning_rate: 0.0002
54
  - train_batch_size: 4
55
  - eval_batch_size: 8
56
  - seed: 42
57
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
58
  - lr_scheduler_type: constant
59
  - lr_scheduler_warmup_ratio: 0.03
60
- - num_epochs: 50
61
  - mixed_precision_training: Native AMP
62
 
63
  ### Training results
 
50
  ### Training hyperparameters
51
 
52
  The following hyperparameters were used during training:
53
+ - learning_rate: 2e-05
54
  - train_batch_size: 4
55
  - eval_batch_size: 8
56
  - seed: 42
57
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
58
  - lr_scheduler_type: constant
59
  - lr_scheduler_warmup_ratio: 0.03
60
+ - num_epochs: 80
61
  - mixed_precision_training: Native AMP
62
 
63
  ### Training results
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:05083285f63a8cc0b4c0dbc20bca74f0b6bc2b9401b33ab50e895bc55fdf3e03
3
  size 27271552
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:902daab5ba88177ba25fb7015b0a2cc96a6539a0f693f48125be8c5bf0277469
3
  size 27271552
runs/Oct14_01-26-33_bcm-dgxa100-0006/events.out.tfevents.1728887207.bcm-dgxa100-0006 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b70c8d0faf4e6c5a199124c3219aa7c7cd96ca39f594308554744ae5d30e5175
3
+ size 189436
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ef8475e5837f8d308ac74e71ba29315c114f9b9327b25476e604f411ce9dbf88
3
  size 5496
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9f5a66dcae9df73b52e3b001e9f3b1dd4e329f07ba99c1648666005cc0d8df88
3
  size 5496