markury commited on
Commit
ed6d958
1 Parent(s): ee482ba

Model card auto-generated by SimpleTuner

Browse files
Files changed (1) hide show
  1. README.md +166 -0
README.md ADDED
@@ -0,0 +1,166 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: other
3
+ base_model: "stabilityai/stable-diffusion-3.5-large"
4
+ tags:
5
+ - sd3
6
+ - sd3-diffusers
7
+ - text-to-image
8
+ - diffusers
9
+ - simpletuner
10
+ - safe-for-work
11
+ - lora
12
+ - template:sd-lora
13
+ - lycoris
14
+ inference: true
15
+ widget:
16
+ - text: 'unconditional (blank prompt)'
17
+ parameters:
18
+ negative_prompt: 'blurry, cropped, ugly'
19
+ output:
20
+ url: ./assets/image_0_0.png
21
+ - text: 'A scene from the animated Studio Ghibli movie Spirited Away, where a man with a violin serenades the night on a bridge while paper lanterns float in the river below.'
22
+ parameters:
23
+ negative_prompt: 'blurry, cropped, ugly'
24
+ output:
25
+ url: ./assets/image_1_0.png
26
+ - text: 'A scene from the animated Studio Ghibli movie Spirited Away, where a small child with a red scarf wanders through a bustling market filled with strange and colorful creatures.'
27
+ parameters:
28
+ negative_prompt: 'blurry, cropped, ugly'
29
+ output:
30
+ url: ./assets/image_2_0.png
31
+ - text: 'A scene from the animated Studio Ghibli movie Spirited Away, featuring a fluffy white cat napping on a train seat as shadows of passengers glide by.'
32
+ parameters:
33
+ negative_prompt: 'blurry, cropped, ugly'
34
+ output:
35
+ url: ./assets/image_3_0.png
36
+ - text: 'A scene from the animated Studio Ghibli movie Spirited Away, where hundreds of lanterns rise into the starry sky, each carrying a wish from unseen characters.'
37
+ parameters:
38
+ negative_prompt: 'blurry, cropped, ugly'
39
+ output:
40
+ url: ./assets/image_4_0.png
41
+ - text: 'A scene from the animated Studio Ghibli movie Spirited Away, where a spirit with glowing eyes floats between the shelves of an endless library, reading ancient books aloud.'
42
+ parameters:
43
+ negative_prompt: 'blurry, cropped, ugly'
44
+ output:
45
+ url: ./assets/image_5_0.png
46
+ - text: 'A photo-realistic image of a cat'
47
+ parameters:
48
+ negative_prompt: 'blurry, cropped, ugly'
49
+ output:
50
+ url: ./assets/image_6_0.png
51
+ ---
52
+
53
+ # sd35-spirited-away-lokr
54
+
55
+ This is a LyCORIS adapter derived from [stabilityai/stable-diffusion-3.5-large](https://huggingface.co/stabilityai/stable-diffusion-3.5-large).
56
+
57
+
58
+ The main validation prompt used during training was:
59
+
60
+
61
+
62
+ ```
63
+ A photo-realistic image of a cat
64
+ ```
65
+
66
+ ## Validation settings
67
+ - CFG: `4.0`
68
+ - CFG Rescale: `0.0`
69
+ - Steps: `20`
70
+ - Sampler: `None`
71
+ - Seed: `42`
72
+ - Resolution: `1024x1024`
73
+
74
+ Note: The validation settings are not necessarily the same as the [training settings](#training-settings).
75
+
76
+ You can find some example images in the following gallery:
77
+
78
+
79
+ <Gallery />
80
+
81
+ The text encoder **was not** trained.
82
+ You may reuse the base model text encoder for inference.
83
+
84
+
85
+ ## Training settings
86
+
87
+ - Training epochs: 0
88
+ - Training steps: 100
89
+ - Learning rate: 1e-05
90
+ - Max grad norm: 0.01
91
+ - Effective batch size: 8
92
+ - Micro-batch size: 8
93
+ - Gradient accumulation steps: 1
94
+ - Number of GPUs: 1
95
+ - Prediction type: flow-matching
96
+ - Rescaled betas zero SNR: False
97
+ - Optimizer: adamw_bf16
98
+ - Precision: Pure BF16
99
+ - Quantised: No
100
+ - Xformers: Not used
101
+ - LyCORIS Config:
102
+ ```json
103
+ {
104
+ "algo": "lokr",
105
+ "multiplier": 1.0,
106
+ "linear_dim": 10000,
107
+ "linear_alpha": 1,
108
+ "factor": 12,
109
+ "apply_preset": {
110
+ "target_module": [
111
+ "Attention"
112
+ ],
113
+ "module_algo_map": {
114
+ "Attention": {
115
+ "factor": 12
116
+ },
117
+ "FeedForward": {
118
+ "factor": 6
119
+ }
120
+ }
121
+ }
122
+ }
123
+ ```
124
+
125
+ ## Datasets
126
+
127
+ ### screencaps-1024
128
+ - Repeats: 10
129
+ - Total number of images: 379
130
+ - Total number of aspect buckets: 1
131
+ - Resolution: 1.048576 megapixels
132
+ - Cropped: False
133
+ - Crop style: None
134
+ - Crop aspect: None
135
+ - Used for regularisation data: No
136
+
137
+
138
+ ## Inference
139
+
140
+
141
+ ```python
142
+ import torch
143
+ from diffusers import DiffusionPipeline
144
+ from lycoris import create_lycoris_from_weights
145
+
146
+ model_id = 'stabilityai/stable-diffusion-3.5-large'
147
+ adapter_id = 'pytorch_lora_weights.safetensors' # you will have to download this manually
148
+ lora_scale = 1.0
149
+ wrapper, _ = create_lycoris_from_weights(lora_scale, adapter_id, pipeline.transformer)
150
+ wrapper.merge_to()
151
+
152
+ prompt = "A photo-realistic image of a cat"
153
+ negative_prompt = 'blurry, cropped, ugly'
154
+ pipeline.to('cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu')
155
+ image = pipeline(
156
+ prompt=prompt,
157
+ negative_prompt=negative_prompt,
158
+ num_inference_steps=20,
159
+ generator=torch.Generator(device='cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu').manual_seed(1641421826),
160
+ width=1024,
161
+ height=1024,
162
+ guidance_scale=4.0,
163
+ ).images[0]
164
+ image.save("output.png", format="PNG")
165
+ ```
166
+