Update README.md
Browse files
README.md
CHANGED
@@ -12,13 +12,13 @@ At the very core of the development of this model is the desire to make sure tha
|
|
12 |
|
13 |
Test this out and see if you find anything interesting or intriguing. I will keep iterating more versions but this one seems like a fun and useful way to start.
|
14 |
|
15 |
-
## Training
|
16 |
```
|
17 |
|
18 |
key: str = "system", key2: str = "instruction"
|
19 |
|
20 |
batch_size=1
|
21 |
-
epochs=10 (
|
22 |
r=16
|
23 |
lora_alpha=32
|
24 |
lora_dropout=0.001
|
|
|
12 |
|
13 |
Test this out and see if you find anything interesting or intriguing. I will keep iterating more versions but this one seems like a fun and useful way to start.
|
14 |
|
15 |
+
## Training (Done on the First Draft V1 of the dataset)
|
16 |
```
|
17 |
|
18 |
key: str = "system", key2: str = "instruction"
|
19 |
|
20 |
batch_size=1
|
21 |
+
epochs=10 (Don't do this for the current version of the dataset, your model WILL overfit. It's very potent.)
|
22 |
r=16
|
23 |
lora_alpha=32
|
24 |
lora_dropout=0.001
|