davidrd123 commited on
Commit
70076f8
1 Parent(s): 4cbe63e

Model card auto-generated by SimpleTuner

Browse files
Files changed (1) hide show
  1. README.md +209 -0
README.md ADDED
@@ -0,0 +1,209 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: creativeml-openrail-m
3
+ base_model: "stabilityai/stable-diffusion-xl-base-1.0"
4
+ tags:
5
+ - sdxl
6
+ - sdxl-diffusers
7
+ - text-to-image
8
+ - diffusers
9
+ - simpletuner
10
+ - safe-for-work
11
+ - lora
12
+ - template:sd-lora
13
+ - standard
14
+ inference: true
15
+ widget:
16
+ - text: 'unconditional (blank prompt)'
17
+ parameters:
18
+ negative_prompt: 'blurry, cropped, ugly'
19
+ output:
20
+ url: ./assets/image_0_0.png
21
+ - text: 'hshge, Mount Fuji viewed from a distance, with cherry blossoms in the foreground. A small village nestles at the base of the mountain.'
22
+ parameters:
23
+ negative_prompt: 'blurry, cropped, ugly'
24
+ output:
25
+ url: ./assets/image_1_0.png
26
+ - text: 'hshge, Hamster'
27
+ parameters:
28
+ negative_prompt: 'blurry, cropped, ugly'
29
+ output:
30
+ url: ./assets/image_2_0.png
31
+ - text: 'hshge, A scene from the Tokaido road, with travelers crossing a wooden bridge. A misty mountain landscape in the background.'
32
+ parameters:
33
+ negative_prompt: 'blurry, cropped, ugly'
34
+ output:
35
+ url: ./assets/image_3_0.png
36
+ - text: 'hshge, A busy fish market in Edo. Vendors display their catch while customers browse. Boats visible in the nearby harbor.'
37
+ parameters:
38
+ negative_prompt: 'blurry, cropped, ugly'
39
+ output:
40
+ url: ./assets/image_4_0.png
41
+ - text: 'hshge, People caught in a sudden rainstorm on a city street, rushing for cover with umbrellas. A large bridge spans the background.'
42
+ parameters:
43
+ negative_prompt: 'blurry, cropped, ugly'
44
+ output:
45
+ url: ./assets/image_5_0.png
46
+ - text: 'hshge, A serene temple complex under a full moon. Lanterns illuminate the path, with silhouettes of pine trees against the night sky.'
47
+ parameters:
48
+ negative_prompt: 'blurry, cropped, ugly'
49
+ output:
50
+ url: ./assets/image_6_0.png
51
+ - text: 'hshge, A traditional Japanese garden in winter. Snow-covered trees and a small bridge over a frozen pond. A figure in a kimono walks along a path.'
52
+ parameters:
53
+ negative_prompt: 'blurry, cropped, ugly'
54
+ output:
55
+ url: ./assets/image_7_0.png
56
+ - text: 'hshge, The modern Tokyo Skytree towering over traditional low-rise buildings. Cherry blossoms frame the view.'
57
+ parameters:
58
+ negative_prompt: 'blurry, cropped, ugly'
59
+ output:
60
+ url: ./assets/image_8_0.png
61
+ - text: 'hshge, A sleek bullet train speeding past Mount Fuji. Rice fields and a small town visible in the middle ground.'
62
+ parameters:
63
+ negative_prompt: 'blurry, cropped, ugly'
64
+ output:
65
+ url: ./assets/image_9_0.png
66
+ - text: 'hshge, The bustling Times Square in New York, with bright billboards and crowds of people. A view reminiscent of Hiroshige''s busy street scenes.'
67
+ parameters:
68
+ negative_prompt: 'blurry, cropped, ugly'
69
+ output:
70
+ url: ./assets/image_10_0.png
71
+ - text: 'hshge, A futuristic Mars colony with dome habitats and space vehicles. The red Martian landscape stretches to the horizon.'
72
+ parameters:
73
+ negative_prompt: 'blurry, cropped, ugly'
74
+ output:
75
+ url: ./assets/image_11_0.png
76
+ - text: 'hshge, An imaginary underwater city with Japanese-style architecture. Fish and sea creatures swim among the buildings.'
77
+ parameters:
78
+ negative_prompt: 'blurry, cropped, ugly'
79
+ output:
80
+ url: ./assets/image_12_0.png
81
+ - text: 'hshge, People wearing VR headsets in a modern cafe. Traditional Japanese elements mix with futuristic technology in the decor.'
82
+ parameters:
83
+ negative_prompt: 'blurry, cropped, ugly'
84
+ output:
85
+ url: ./assets/image_13_0.png
86
+ - text: 'hshge, hamster'
87
+ parameters:
88
+ negative_prompt: 'blurry, cropped, ugly'
89
+ output:
90
+ url: ./assets/image_14_0.png
91
+ ---
92
+
93
+ # Hiroshige-SDXL-LoRA
94
+
95
+ This is a standard PEFT LoRA derived from [stabilityai/stable-diffusion-xl-base-1.0](https://huggingface.co/stabilityai/stable-diffusion-xl-base-1.0).
96
+
97
+
98
+ The main validation prompt used during training was:
99
+
100
+
101
+
102
+ ```
103
+ hshge, hamster
104
+ ```
105
+
106
+ ## Validation settings
107
+ - CFG: `4.2`
108
+ - CFG Rescale: `0.0`
109
+ - Steps: `20`
110
+ - Sampler: `None`
111
+ - Seed: `42`
112
+ - Resolution: `1024x1024`
113
+
114
+ Note: The validation settings are not necessarily the same as the [training settings](#training-settings).
115
+
116
+ You can find some example images in the following gallery:
117
+
118
+
119
+ <Gallery />
120
+
121
+ The text encoder **was not** trained.
122
+ You may reuse the base model text encoder for inference.
123
+
124
+
125
+ ## Training settings
126
+
127
+ - Training epochs: 0
128
+ - Training steps: 200
129
+ - Learning rate: 8e-05
130
+ - Effective batch size: 8
131
+ - Micro-batch size: 8
132
+ - Gradient accumulation steps: 1
133
+ - Number of GPUs: 1
134
+ - Prediction type: epsilon
135
+ - Rescaled betas zero SNR: False
136
+ - Optimizer: adamw_bf16
137
+ - Precision: Pure BF16
138
+ - Quantised: Yes: int8-quanto
139
+ - Xformers: Not used
140
+ - LoRA Rank: 64
141
+ - LoRA Alpha: None
142
+ - LoRA Dropout: 0.1
143
+ - LoRA initialisation style: default
144
+
145
+
146
+ ## Datasets
147
+
148
+ ### hiroshige-sdxl-512
149
+ - Repeats: 10
150
+ - Total number of images: 219
151
+ - Total number of aspect buckets: 6
152
+ - Resolution: 0.262144 megapixels
153
+ - Cropped: False
154
+ - Crop style: None
155
+ - Crop aspect: None
156
+ ### hiroshige-sdxl-1024
157
+ - Repeats: 10
158
+ - Total number of images: 219
159
+ - Total number of aspect buckets: 6
160
+ - Resolution: 1.048576 megapixels
161
+ - Cropped: False
162
+ - Crop style: None
163
+ - Crop aspect: None
164
+ ### hiroshige-sdxl-512-crop
165
+ - Repeats: 10
166
+ - Total number of images: 219
167
+ - Total number of aspect buckets: 1
168
+ - Resolution: 0.262144 megapixels
169
+ - Cropped: True
170
+ - Crop style: random
171
+ - Crop aspect: square
172
+ ### hiroshige-sdxl-1024-crop
173
+ - Repeats: 10
174
+ - Total number of images: 219
175
+ - Total number of aspect buckets: 1
176
+ - Resolution: 1.048576 megapixels
177
+ - Cropped: True
178
+ - Crop style: random
179
+ - Crop aspect: square
180
+
181
+
182
+ ## Inference
183
+
184
+
185
+ ```python
186
+ import torch
187
+ from diffusers import DiffusionPipeline
188
+
189
+ model_id = 'stabilityai/stable-diffusion-xl-base-1.0'
190
+ adapter_id = 'davidrd123/Hiroshige-SDXL-LoRA'
191
+ pipeline = DiffusionPipeline.from_pretrained(model_id)
192
+ pipeline.load_lora_weights(adapter_id)
193
+
194
+ prompt = "hshge, hamster"
195
+ negative_prompt = 'blurry, cropped, ugly'
196
+ pipeline.to('cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu')
197
+ image = pipeline(
198
+ prompt=prompt,
199
+ negative_prompt=negative_prompt,
200
+ num_inference_steps=20,
201
+ generator=torch.Generator(device='cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu').manual_seed(1641421826),
202
+ width=1024,
203
+ height=1024,
204
+ guidance_scale=4.2,
205
+ guidance_rescale=0.0,
206
+ ).images[0]
207
+ image.save("output.png", format="PNG")
208
+ ```
209
+