markrodrigo
/

Llama-3.2-3B-Instruct-Spatial-SQL-1.0

Text Generation

Model card Files Files and versions Community

markrodrigo commited on Oct 6

Commit

87cf2f4

•

1 Parent(s): d98cee2

Update Hypers

Files changed (1) hide show

README.md +12 -0

README.md CHANGED Viewed

@@ -173,6 +173,18 @@ More information needed
 ### Training data
 Custom synthetic
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss |

 ### Training data
 Custom synthetic
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 3e-05
+- train_batch_size: 10
+- eval_batch_size: 3
+- distributed_type: multi-GPU
+- num_devices: 2
+- optimizer: Adam 8bit
+- lr_scheduler_type: linear
+- num_epochs: 4
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss |