alea31415
/

yama-no-susume

Model card Files Files and versions Community

cyber-meow commited on Dec 31, 2022

Commit

ef28c28

•

1 Parent(s): 4e83663

update readme

Files changed (1) hide show

README.md +1 -0

README.md CHANGED Viewed

@@ -53,6 +53,7 @@ However, I estimate a model of similar quality can be trained with fewer than 20
 - The model was first trained for 18000 steps, at batch size 8, lr 1e-6, resolution 640, and conditional dropping rate of 15%.
 - After this, I modified a little the captions and trained the model for another 22000 steps, at batch size 8, lr 1e-6, reslution 704, and conditional dropping rate of 15%.
 Note that as a consequence of the weighting scheme which translates into a number of different multiply for each image,
 the count of repeat and epoch has a quite different meaning here.

 - The model was first trained for 18000 steps, at batch size 8, lr 1e-6, resolution 640, and conditional dropping rate of 15%.
 - After this, I modified a little the captions and trained the model for another 22000 steps, at batch size 8, lr 1e-6, reslution 704, and conditional dropping rate of 15%.
+(Intermediate checkpoints can be found in the branch `all`)
 Note that as a consequence of the weighting scheme which translates into a number of different multiply for each image,
 the count of repeat and epoch has a quite different meaning here.