alea31415
/

LyCORIS-experiments

Model card Files Files and versions Community

alea31415 commited on Mar 30, 2023

Commit

261a584

•

1 Parent(s): 667927b

Update README.md

Browse files

Files changed (1) hide show

README.md +71 -2

README.md CHANGED Viewed

@@ -13,9 +13,25 @@ aniscreen, fanart
 ```
 For `0324_all_aniscreen_tags`, I accidentally tag all the character images with `aniscreen`.
-For `0325_aniscreen_fanart_styles`, things are done correctly (anime screenshots tagged as `aniscreen`, fanart tagged as `fanart`).
 ### Setting
 Default settings are
@@ -30,6 +46,7 @@ The configuration json files can otherwise be found in the `config` subdirectori
 However, some experiments concern the effect of tags for which I regenerate the txt file and the difference can not be seen from the configuration file in this case.
 For now this concerns `05tag` for which tags are only used with probability 0.5.
 ### Some observations
 For a thorough comparison please refer to the `generated_samples` folder.
@@ -305,10 +322,62 @@ Lykon did show some successful results by only training with anime images on NED
 Clearly, the thing that really matters is how the model is made, and not how the model looks like. A model that is versatile in style does not make it a good base model for whatever kind of training. In fact, VBP2-2 has around 300 styles trained in but LoHa trained on top of it does not transfer well to other models.
 Similarly, two models that produce similar style do not mean they transfer well to each other. Both MFB and Salt-Mix have strong anime screenshot style but a LoHa trained on MFB does not transfer well to Salt-Mix.
 #### Training Speed
-It is also suggested that you train faster on AnyLora. I try to look into this in several ways but I don't see a clear difference.
 First, I use the 6000step checkpoints for characters
 ![xyz_grid-0007-20230330035309](https://huggingface.co/alea31415/LyCORIS-experiments/resolve/main/generated_samples_0329/xyz_grid-0007-20230330035309.jpg)

 ```
 For `0324_all_aniscreen_tags`, I accidentally tag all the character images with `aniscreen`.
+For the others, things are done correctly (anime screenshots tagged as `aniscreen`, fanart tagged as `fanart`).
+For reference, this is what each character looks like
+**Anisphia**
+![Anisphia](https://huggingface.co/alea31415/LyCORIS-experiments/resolve/main/groundtruth_images/Anisphia.png)
+**Euphyllia**
+![Euphyllia](https://huggingface.co/alea31415/LyCORIS-experiments/resolve/main/groundtruth_images/Euphyllia.jpg)
+**Tilty**
+![Tilty](https://huggingface.co/alea31415/LyCORIS-experiments/resolve/main/groundtruth_images/Tilty.jpeg)
+**OyamaMahiro (white hair one) and OyamaMihari (black hair one)**
+![OyamaMahiro+OyamaMihari](https://huggingface.co/alea31415/LyCORIS-experiments/resolve/main/groundtruth_images/OyamaMahiro+OyamaMihari.jpg)
+As for the styles please check the artists' pixiv yourself (note there are R-18 images)
 ### Setting
 Default settings are
 However, some experiments concern the effect of tags for which I regenerate the txt file and the difference can not be seen from the configuration file in this case.
 For now this concerns `05tag` for which tags are only used with probability 0.5.
 ### Some observations
 For a thorough comparison please refer to the `generated_samples` folder.
 Clearly, the thing that really matters is how the model is made, and not how the model looks like. A model that is versatile in style does not make it a good base model for whatever kind of training. In fact, VBP2-2 has around 300 styles trained in but LoHa trained on top of it does not transfer well to other models.
 Similarly, two models that produce similar style do not mean they transfer well to each other. Both MFB and Salt-Mix have strong anime screenshot style but a LoHa trained on MFB does not transfer well to Salt-Mix.
+**A Case Study on Customized Merge Model**
+To understand whether you can train a style to be used on a group of models by simply merging these models, I pick a few models and merge them myself to see if this is really effective. I especially choose models that are far from each other, and consider both average and add difference merges. Here are the two recipes that I use.
+```
+# Recipe for average merge
+tmp1 = nai-full-pruned + bp_nman_e29, 0.5, fp16, ckpt
+tmp2 = __O1__ + nep, 0.333, fp16, ckpt
+tmp3 = __O2__ + Pastel-Mix, 0.25, fp16, ckpt
+tmp4 = __O3__ + fantasyBackground_v10PrunedFp16, 0.2, fp16, ckpt
+tmp5 = __O4__ + MyneFactoryBase_V1.0, 0.166, fp16, ckpt
+AleaMix = __O5__ + anylora_FTMSE, 0.142, fp16, ckpt
+```
+```
+# Recipe for add difference merge
+tmp1 = nai-full-pruned + bp_nman_e29, 0.5, fp16, ckpt
+tmp2-ad = __O1__ + nep + nai-full-pruned, 0.5, fp16, safetensors
+tmp3-ad = __O2__ + Pastel-Mix + nai-full-pruned, 0.5, fp16, safetensors
+tmp4-ad = __O3__ + fantasyBackground_v10PrunedFp16 + nai-full-pruned, 0.5, fp16, safetensors
+tmp5-ad = __O4__ + MyneFactoryBase_V1.0 + nai-full-pruned, 0.5, fp16, safetensors
+AleaMix-ad = __O5__ + anylora_FTMSE + nai-full-pruned, 0.5, fp16, safetensors
+```
+I then trained on top of tmp3, AleaMix, tmp3-ad, and AleaMix-ad. It turns out that these models are too different so it does not work very well. Getting style transfer to PastelMix and FantasyBackgrond are quite difficult. I however observe the following.
+- We generally get bad results when applying to NAI. This is in line with previous experiments.
+- We get better transfer to NMFSAN compared to most of previous LoHas that are not trained on BP family.
+- Add difference with too many models (7) with high weight (0.5) blows the model up: you can still train on it and get reasonable result but it does not transfer to individual component.
+- Add difference with a smaller number of models (4) can work. It seems to be more effective then simple average sometimes (note that how the model trained on tmp3-ad manages to cancel out the style of nep and PastelMix in the examples below).
+![xyz_grid-0000-20230330204940](https://huggingface.co/alea31415/LyCORIS-experiments/resolve/main/generated_samples_0329/xyz_grid-0000-20230330204940.jpg)
+![xyz_grid-0008-20230330221018](https://huggingface.co/alea31415/LyCORIS-experiments/resolve/main/generated_samples_0329/xyz_grid-0008-20230330221018.jpg)
+![xyz_grid-0009-20230330222021](https://huggingface.co/alea31415/LyCORIS-experiments/resolve/main/generated_samples_0329/xyz_grid-0009-20230330222021.jpg)
+![xyz_grid-0005-20230330212715](https://huggingface.co/alea31415/LyCORIS-experiments/resolve/main/generated_samples_0329/xyz_grid-0005-20230330212715.jpg)
+![xyz_grid-0004-20230330211628](https://huggingface.co/alea31415/LyCORIS-experiments/resolve/main/generated_samples_0329/xyz_grid-0004-20230330211628.jpg)
+*An interesting observation*
+While the model AleaMix-ad is barely usable, the LoHa trained on it produces very strong styles and excellent details
+Results on AleaMix (the weighted sum version)
+![xyz_grid-0011-20230330224054](https://huggingface.co/alea31415/LyCORIS-experiments/resolve/main/generated_samples_0329/xyz_grid-0011-20230330224054.jpg)
+Results on AleaMix-ad (the add difference version)
+![xyz_grid-0012-20230330224058](https://huggingface.co/alea31415/LyCORIS-experiments/resolve/main/generated_samples_0329/xyz_grid-0012-20230330224058.jpg)
+However, you may also need to worry about some bad hand in such a model
+![00032-20230330225216](https://huggingface.co/alea31415/LyCORIS-experiments/resolve/main/generated_samples_0329/00032-20230330225216.png)
 #### Training Speed
+It is also suggested that you train faster on AnyLora. I try to look into this in several ways but I don't see a clear difference.
+Note that we should mostly focus on the diagonal (LoHa applied on the model used to train it).
 First, I use the 6000step checkpoints for characters
 ![xyz_grid-0007-20230330035309](https://huggingface.co/alea31415/LyCORIS-experiments/resolve/main/generated_samples_0329/xyz_grid-0007-20230330035309.jpg)