Update README.md
Browse files
README.md
CHANGED
@@ -26,15 +26,15 @@ OrpoGemma-2-9B-TR is a Turkish fine-tuned version of [google/gemma-2-9b-it](http
|
|
26 |
- **Training Time**: 2.5 hours on NVIDIA H100
|
27 |
|
28 |
### QLoRA Configurations:
|
29 |
-
- `lora_r`:
|
30 |
- `lora_alpha`: 32
|
31 |
- `lora_dropout`: 0.05
|
32 |
|
33 |
### ORPO Training Parameters
|
34 |
- `lr`: 2e-6
|
35 |
- `epochs`: 3
|
36 |
-
- `
|
37 |
-
- `
|
38 |
|
39 |
## 📈 Training Curves
|
40 |
|
|
|
26 |
- **Training Time**: 2.5 hours on NVIDIA H100
|
27 |
|
28 |
### QLoRA Configurations:
|
29 |
+
- `lora_r`: 16
|
30 |
- `lora_alpha`: 32
|
31 |
- `lora_dropout`: 0.05
|
32 |
|
33 |
### ORPO Training Parameters
|
34 |
- `lr`: 2e-6
|
35 |
- `epochs`: 3
|
36 |
+
- `per_device_train_batch_size`: 8
|
37 |
+
- `gradient_accumulation_steps`: 4
|
38 |
|
39 |
## 📈 Training Curves
|
40 |
|