cyber-meow
commited on
Commit
•
ef28c28
1
Parent(s):
4e83663
update readme
Browse files
README.md
CHANGED
@@ -53,6 +53,7 @@ However, I estimate a model of similar quality can be trained with fewer than 20
|
|
53 |
|
54 |
- The model was first trained for 18000 steps, at batch size 8, lr 1e-6, resolution 640, and conditional dropping rate of 15%.
|
55 |
- After this, I modified a little the captions and trained the model for another 22000 steps, at batch size 8, lr 1e-6, reslution 704, and conditional dropping rate of 15%.
|
|
|
56 |
|
57 |
Note that as a consequence of the weighting scheme which translates into a number of different multiply for each image,
|
58 |
the count of repeat and epoch has a quite different meaning here.
|
|
|
53 |
|
54 |
- The model was first trained for 18000 steps, at batch size 8, lr 1e-6, resolution 640, and conditional dropping rate of 15%.
|
55 |
- After this, I modified a little the captions and trained the model for another 22000 steps, at batch size 8, lr 1e-6, reslution 704, and conditional dropping rate of 15%.
|
56 |
+
(Intermediate checkpoints can be found in the branch `all`)
|
57 |
|
58 |
Note that as a consequence of the weighting scheme which translates into a number of different multiply for each image,
|
59 |
the count of repeat and epoch has a quite different meaning here.
|