benchang1110
commited on
Commit
•
d4a6e28
1
Parent(s):
88773ae
Update README.md
Browse files
README.md
CHANGED
@@ -134,7 +134,7 @@ The following training hyperparameters are used in feature alignment and task sp
|
|
134 |
|
135 |
| Data size | Global Batch Size | Learning Rate | Epochs | Max Length | Weight Decay |
|
136 |
|--------------|-------------------|---------------|--------|------------|--------------|
|
137 |
-
|
|
138 |
|
139 |
We use full-parameter finetuning for the projector and apply LoRA to the language model.
|
140 |
|
|
|
134 |
|
135 |
| Data size | Global Batch Size | Learning Rate | Epochs | Max Length | Weight Decay |
|
136 |
|--------------|-------------------|---------------|--------|------------|--------------|
|
137 |
+
| 1M | 16 | 5e-5 | 1 | 2048 | 1e-5 |
|
138 |
|
139 |
We use full-parameter finetuning for the projector and apply LoRA to the language model.
|
140 |
|