JerMa88 commited on
Commit
2090935
1 Parent(s): 076f0cf

llama-3.2-1B-personality-detection-E

Browse files
README.md CHANGED
@@ -50,14 +50,14 @@ The following `bitsandbytes` quantization config was used during training:
50
  ### Training hyperparameters
51
 
52
  The following hyperparameters were used during training:
53
- - learning_rate: 0.0002
54
  - train_batch_size: 4
55
  - eval_batch_size: 8
56
  - seed: 42
57
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
58
  - lr_scheduler_type: constant
59
  - lr_scheduler_warmup_ratio: 0.03
60
- - num_epochs: 50
61
  - mixed_precision_training: Native AMP
62
 
63
  ### Training results
 
50
  ### Training hyperparameters
51
 
52
  The following hyperparameters were used during training:
53
+ - learning_rate: 2e-05
54
  - train_batch_size: 4
55
  - eval_batch_size: 8
56
  - seed: 42
57
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
58
  - lr_scheduler_type: constant
59
  - lr_scheduler_warmup_ratio: 0.03
60
+ - num_epochs: 80
61
  - mixed_precision_training: Native AMP
62
 
63
  ### Training results
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:50de793ed8f48e2323abfe043c216733727ca7887120ddcdbff25d8249ac3952
3
  size 27271552
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f620d0f59acc5b4e36f7de40f5125fc23449c60eb07043d65570d9acbd5dffcc
3
  size 27271552
runs/Oct14_01-26-34_bcm-dgxa100-0006/events.out.tfevents.1728887207.bcm-dgxa100-0006 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f5f599d64eeaf139bc3a7b095a4c6ca1881f22d089d03cabdae97d36c1b43d81
3
+ size 189436
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:001b1960dd5bc835eee2e2640bc4eafa1ed8f41339c2c76baf6caae1c2fe032a
3
  size 5496
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2ba0fa51cf8e6f80d38a19547a12bc1765b050797d9c42c6864a35c829b29ad9
3
  size 5496