JerMa88 commited on
Commit
1d1d1ce
1 Parent(s): 0500bb7

llama-3.2-1B-personality-detection-C

Browse files
README.md CHANGED
@@ -50,14 +50,14 @@ The following `bitsandbytes` quantization config was used during training:
50
  ### Training hyperparameters
51
 
52
  The following hyperparameters were used during training:
53
- - learning_rate: 0.0002
54
  - train_batch_size: 4
55
  - eval_batch_size: 8
56
  - seed: 42
57
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
58
  - lr_scheduler_type: constant
59
  - lr_scheduler_warmup_ratio: 0.03
60
- - num_epochs: 50
61
  - mixed_precision_training: Native AMP
62
 
63
  ### Training results
 
50
  ### Training hyperparameters
51
 
52
  The following hyperparameters were used during training:
53
+ - learning_rate: 2e-05
54
  - train_batch_size: 4
55
  - eval_batch_size: 8
56
  - seed: 42
57
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
58
  - lr_scheduler_type: constant
59
  - lr_scheduler_warmup_ratio: 0.03
60
+ - num_epochs: 80
61
  - mixed_precision_training: Native AMP
62
 
63
  ### Training results
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:9854d8ca7b6b7e8d2c1299cffaafefea134217e00cf9a850c5f36a4ee5add6df
3
  size 27271552
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:75486f1fe6f394aa2e69bf2b652fd8e7f306eeee000f67d9de274781728dedae
3
  size 27271552
runs/Oct14_01-26-34_bcm-dgxa100-0006/events.out.tfevents.1728887207.bcm-dgxa100-0006 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a1b8ab1bd76f7b75fc24799b177ff9d2ba0bb0a55daf11ac00b08b295429c4ae
3
+ size 188576
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:bc14b742c77bd897a07b8fa4ce0e7ab18395ce4e2c93cc88e1aca2ed144a9970
3
  size 5496
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:efbd37fe3e67e74cca883ceeb452c2cb4de76d81fb81567d28ae5a581e581acf
3
  size 5496