alea31415
/

LyCORIS-experiments

Model card Files Files and versions Community

cyber-meow commited on Mar 27, 2023

Commit

08f4713

•

2 Parent(s): b3c8f62 7bdeb47

Merge branch 'main' of https://huggingface.co/alea31415/LyCORIS-experiments into main

Browse files

Files changed (1) hide show

README.md +17 -2

README.md CHANGED Viewed

@@ -37,12 +37,27 @@ For a thorough comparaison please refer to the `generated_samples` folder.
 Dataset, in general, is the most important out of all.
 The common wisdom that we should prune anything that we want to be attach to the trigger word is exactly the way to go for.
 No tags at all (top three rows) is terrible, especially for style training.
-Having all the tags (bottom three rows) remove the traits from subjects if these tags are not used during sampling (not completely true but more or less the case).
 ![00066-20230326090858](https://huggingface.co/alea31415/LyCORIS-experiments/resolve/main/generated_samples/00066-20230326090858.png)
-#### Training Resolution
 The most prominent benefit of training at higher resolution is that it helps generating more complex/detailed background.
 Chances are that you can get more details about the outfit or pupils etc.

 Dataset, in general, is the most important out of all.
 The common wisdom that we should prune anything that we want to be attach to the trigger word is exactly the way to go for.
 No tags at all (top three rows) is terrible, especially for style training.
+Having all the tags (bottom three rows) remove the traits from subjects if these tags are not used during sampling (not completely true but more or less the case, see also discussion below).
 ![00066-20230326090858](https://huggingface.co/alea31415/LyCORIS-experiments/resolve/main/generated_samples/00066-20230326090858.png)
+#### The effect of style images on characters
+I do beleive regularization images are important, far more important than tweaking any hyperparameters. They slow down training but also make sure that the undesired aspect are less baked into the model if we have images of other types, even if they are not for the subjects we train for.
+Comparing the models trained with and without style images, we can see that models trained with general style images have less anime styles baked in. The difference is particularly clear for Tilty, who only have anime screenshots for training.
+![00103-20230327084923](https://huggingface.co/alea31415/LyCORIS-experiments/resolve/main/generated_samples/00103-20230327084923.png)
+On the other hand, the default clothes seem to be better trained when there is no regularization image. While this may seem beneficial, it is worth noticing that I keep all the output tags. Therefore, in a sense we only want to get the outputs when we prompt them explicitly. The magic of having the trigger words to fill in what is not in caption seems to be more pronouncing when we have regularization images. In any case, this magic will not work forever as we will eventually start overfitting. The following image show that we get images that are much closer after putting clothes in prompts.
+![00105-20230327090703](https://huggingface.co/alea31415/LyCORIS-experiments/resolve/main/generated_samples/00105-20230327090703.png)
+In any case, if your regularization images are properly tagged with of a lot of concepts, then you always have the benefit that you can combine them with the main things you train for.
+#### Training resolution
 The most prominent benefit of training at higher resolution is that it helps generating more complex/detailed background.
 Chances are that you can get more details about the outfit or pupils etc.