metadata

license: other
base_model: stabilityai/stable-diffusion-3.5-medium
tags:
  - sd3
  - sd3-diffusers
  - text-to-image
  - diffusers
  - simpletuner
  - not-for-all-audiences
  - lora
  - template:sd-lora
  - lycoris
inference: true
widget:
  - text: unconditional (blank prompt)
    parameters:
      negative_prompt: blurry, cropped, ugly
    output:
      url: ./assets/image_0_0.png
  - text: a picture of tommy chong
    parameters:
      negative_prompt: blurry, cropped, ugly
    output:
      url: ./assets/image_1_0.png
  - text: young tommy chong
    parameters:
      negative_prompt: blurry, cropped, ugly
    output:
      url: ./assets/image_2_0.png
  - text: >-
      a stoic photograph of tommy chong. he looks off into the distance,
      standing up against the railing of a ship. the sky is cloudy.
    parameters:
      negative_prompt: blurry, cropped, ugly
    output:
      url: ./assets/image_3_0.png
  - text: an elderly tommy chong as a contestant on Wheel of Fortune
    parameters:
      negative_prompt: blurry, cropped, ugly
    output:
      url: ./assets/image_4_0.png
  - text: >-
      tommy chong as a superhero in the style of studio ghibli. he wears a metal
      armor suit with glowing lights and power indicators.
    parameters:
      negative_prompt: blurry, cropped, ugly
    output:
      url: ./assets/image_5_0.png
  - text: >-
      tommy chong in a casket, dead. he is dead and it is a funeral. the text
      overhead says 'HE HAS NOT RISEN'.
    parameters:
      negative_prompt: blurry, cropped, ugly
    output:
      url: ./assets/image_6_0.png
  - text: a picture of cheech marin
    parameters:
      negative_prompt: blurry, cropped, ugly
    output:
      url: ./assets/image_7_0.png
  - text: young cheech marin
    parameters:
      negative_prompt: blurry, cropped, ugly
    output:
      url: ./assets/image_8_0.png
  - text: >-
      a stoic photograph of cheech marin. he looks off into the distance,
      standing up against the railing of a ship. the sky is cloudy.
    parameters:
      negative_prompt: blurry, cropped, ugly
    output:
      url: ./assets/image_9_0.png
  - text: an elderly cheech marin as a contestant on Wheel of Fortune
    parameters:
      negative_prompt: blurry, cropped, ugly
    output:
      url: ./assets/image_10_0.png
  - text: >-
      cheech marin as a superhero in the style of studio ghibli. he wears a
      metal armor suit with glowing lights and power indicators.
    parameters:
      negative_prompt: blurry, cropped, ugly
    output:
      url: ./assets/image_11_0.png
  - text: >-
      cheech marin in a casket, dead. he is dead and it is a funeral. the text
      overhead says 'HE HAS NOT RISEN'.
    parameters:
      negative_prompt: blurry, cropped, ugly
    output:
      url: ./assets/image_12_0.png
  - text: >-
      cheech marin sitting to the left of tommy chong on the set of a television
      interview
    parameters:
      negative_prompt: blurry, cropped, ugly
    output:
      url: ./assets/image_13_0.png
  - text: >-
      cheech marin sitting to the right of tommy chong on the set of a
      television interview
    parameters:
      negative_prompt: blurry, cropped, ugly
    output:
      url: ./assets/image_14_0.png
  - text: >-
      cheech and chong sitting together on the stoop of a new york apartment
      building, 1972
    parameters:
      negative_prompt: blurry, cropped, ugly
    output:
      url: ./assets/image_15_0.png
  - text: >-
      the iconic duo cheech and chong on stage performing stand-up comedy
      together in 2008
    parameters:
      negative_prompt: blurry, cropped, ugly
    output:
      url: ./assets/image_16_0.png
  - text: A photo-realistic image of a cat
    parameters:
      negative_prompt: blurry, cropped, ugly
    output:
      url: ./assets/image_17_0.png

sd35m-photo-mixedres-rGN-sS3

This is a LyCORIS adapter derived from stabilityai/stable-diffusion-3.5-medium.

The main validation prompt used during training was:

A photo-realistic image of a cat

Validation settings

CFG: 3.0
CFG Rescale: 0.0
Steps: 20
Sampler: None
Seed: 42
Resolution: 1024x1024

Note: The validation settings are not necessarily the same as the training settings.

You can find some example images in the following gallery:

Prompt
unconditional (blank prompt)

Negative Prompt
blurry, cropped, ugly

Prompt
Alien planet, strange rock formations, glowing plants, bizarre creatures, surreal atmosphere

Negative Prompt
blurry, cropped, ugly

Prompt
Alien marketplace, bizarre creatures, exotic goods, vibrant colors, otherworldly atmosphere

Negative Prompt
blurry, cropped, ugly

Prompt
Child holding a balloon, happy expression, colorful balloons, sunny day, high detail

Negative Prompt
blurry, cropped, ugly

Prompt
a 4-panel comic strip showing an orange cat saying the words 'HELP' and 'LASAGNA'

Negative Prompt
blurry, cropped, ugly

Prompt
a hand is holding a comic book with a cover that reads 'The Adventures of Superhero'

Negative Prompt
blurry, cropped, ugly

Prompt
Underground cave filled with crystals, glowing lights, reflective surfaces, fantasy environment, high detail

Negative Prompt
blurry, cropped, ugly

Prompt
Bustling cyberpunk bazaar, vendors, neon signs, advanced tech, crowded, high detail

Negative Prompt
blurry, cropped, ugly

Prompt
Cyberpunk hacker in a dark room, neon glow, multiple screens, intense focus, high detail

Negative Prompt
blurry, cropped, ugly

Prompt
a cybernetic anne of green gables with neural implant and bio mech augmentations

Negative Prompt
blurry, cropped, ugly

Prompt
Post-apocalyptic cityscape, ruined buildings, overgrown vegetation, dark and gritty, high detail

Negative Prompt
blurry, cropped, ugly

Prompt
Magical castle in a lush forest, glowing windows, fantasy architecture, high resolution, detailed textures

Negative Prompt
blurry, cropped, ugly

Prompt
Ruins of an ancient temple in an enchanted forest, glowing runes, mystical creatures, high detail

Negative Prompt
blurry, cropped, ugly

Prompt
Mystical forest, glowing plants, fairies, magical creatures, fantasy art, high detail

Negative Prompt
blurry, cropped, ugly

Prompt
Magical garden with glowing flowers, fairies, serene atmosphere, detailed plants, high resolution

Negative Prompt
blurry, cropped, ugly

Prompt
Whimsical garden filled with fairies, magical plants, sparkling lights, serene atmosphere, high detail

Negative Prompt
blurry, cropped, ugly

Prompt
Majestic dragon soaring through the sky, detailed scales, dynamic pose, fantasy art, high resolution

Negative Prompt
blurry, cropped, ugly

Prompt
Fantasy world, floating islands in the sky, waterfalls, lush vegetation, detailed landscape, high resolution

Negative Prompt
blurry, cropped, ugly

Prompt
Futuristic city skyline at night, neon lights, cyberpunk style, high contrast, sharp focus

Negative Prompt
blurry, cropped, ugly

Prompt
Space battle scene, starships fighting, laser beams, explosions, cosmic background

Negative Prompt
blurry, cropped, ugly

Prompt
Abandoned fairground at night, eerie rides, ghostly figures, fog, dark atmosphere, high detail

Negative Prompt
blurry, cropped, ugly

Prompt
Spooky haunted mansion on a hill, dark and eerie, glowing windows, ghostly atmosphere, high detail

Negative Prompt
blurry, cropped, ugly

Prompt
a hardcover physics textbook that is called PHYSICS FOR DUMMIES

Negative Prompt
blurry, cropped, ugly

Prompt
Epic medieval battle, knights in armor, dynamic action, detailed landscape, high resolution

Negative Prompt
blurry, cropped, ugly

Prompt
Bustling medieval market with merchants, knights, and jesters, vibrant colors, detailed

Negative Prompt
blurry, cropped, ugly

Prompt
Cozy medieval tavern, warm firelight, adventurers drinking, detailed interior, rustic atmosphere

Negative Prompt
blurry, cropped, ugly

Prompt
Forest with neon-lit trees, glowing plants, bioluminescence, surreal atmosphere, high detail

Negative Prompt
blurry, cropped, ugly

Prompt
Bright neon sign in a busy city street, 'Open 24 Hours', bold typography, glowing lights

Negative Prompt
blurry, cropped, ugly

Prompt
Vibrant neon sign, 'Bar', bold typography, dark background, glowing lights, detailed design

Negative Prompt
blurry, cropped, ugly

Prompt
Pirate ship on the high seas, stormy weather, detailed sails, dramatic waves, photorealistic

Negative Prompt
blurry, cropped, ugly

Prompt
Pirate discovering a treasure chest, detailed gold coins, tropical island, dramatic lighting

Negative Prompt
blurry, cropped, ugly

Prompt
a photograph of a woman experiencing a psychedelic trip. trippy, 8k, uhd, fractal

Negative Prompt
blurry, cropped, ugly

Prompt
Cozy cafe on a rainy day, people sipping coffee, warm lights, reflections on wet pavement, photorealistic

Negative Prompt
blurry, cropped, ugly

Prompt
1980s arcade, neon lights, vintage game machines, kids playing, vibrant colors, nostalgic atmosphere

Negative Prompt
blurry, cropped, ugly

Prompt
1980s game room with vintage arcade machines, neon lights, vibrant colors, nostalgic feel

Negative Prompt
blurry, cropped, ugly

Prompt
Robot blacksmith forging metal, sparks flying, detailed workshop, futuristic and medieval blend

Negative Prompt
blurry, cropped, ugly

Prompt
Sleek robot performing a dance, futuristic theater, holographic effects, detailed, high resolution

Negative Prompt
blurry, cropped, ugly

Prompt
High-tech factory where robots are assembled, detailed machinery, futuristic setting, high detail

Negative Prompt
blurry, cropped, ugly

Prompt
Garden tended by robots, mechanical plants, colorful flowers, futuristic setting, high detail

Negative Prompt
blurry, cropped, ugly

Prompt
Cute robotic pet, futuristic home, sleek design, detailed features, friendly and animated

Negative Prompt
blurry, cropped, ugly

Prompt
cctv trail camera night time security picture of a wendigo in the woods

Negative Prompt
blurry, cropped, ugly

Prompt
Astronaut exploring an alien planet, detailed landscape, futuristic suit, cosmic background

Negative Prompt
blurry, cropped, ugly

Prompt
Futuristic space station orbiting a distant exoplanet, sleek design, detailed structures, cosmic backdrop

Negative Prompt
blurry, cropped, ugly

Prompt
a person holding a sign that reads 'SOON'

Negative Prompt
blurry, cropped, ugly

Prompt
Steampunk airship in the sky, intricate design, Victorian aesthetics, dynamic scene, high detail

Negative Prompt
blurry, cropped, ugly

Prompt
Steampunk inventor in a workshop, intricate gadgets, Victorian attire, mechanical arm, goggles

Negative Prompt
blurry, cropped, ugly

Prompt
Stormy ocean with towering waves, dramatic skies, detailed water, intense atmosphere, high resolution

Negative Prompt
blurry, cropped, ugly

Prompt
Dramatic stormy sea, lighthouse in the distance, lightning striking, dark clouds, high detail

Negative Prompt
blurry, cropped, ugly

Prompt
Graffiti artist creating a mural, vibrant colors, urban setting, dynamic action, high resolution

Negative Prompt
blurry, cropped, ugly

Prompt
Urban alleyway filled with vibrant graffiti art, tags and murals, realistic textures

Negative Prompt
blurry, cropped, ugly

Prompt
Urban street sign, 'Main Street', bold typography, realistic textures, weathered look

Negative Prompt
blurry, cropped, ugly

Prompt
Classic car show with vintage vehicles, vibrant colors, nostalgic atmosphere, high detail

Negative Prompt
blurry, cropped, ugly

Prompt
Retro diner sign, 'Joe's Diner', classic 1950s design, neon lights, weathered look

Negative Prompt
blurry, cropped, ugly

Prompt
Vintage store sign with elaborate typography, 'Antique Shop', hand-painted, weathered look

Negative Prompt
blurry, cropped, ugly

Prompt
A photo-realistic image of a cat

Negative Prompt
blurry, cropped, ugly

The text encoder was not trained. You may reuse the base model text encoder for inference.

Training settings

Training epochs: 1
Training steps: 14000
Learning rate: 0.0001
Max grad norm: 0.01
Effective batch size: 12
- Micro-batch size: 4
- Gradient accumulation steps: 1
- Number of GPUs: 3
Prediction type: flow-matching
Rescaled betas zero SNR: False
Optimizer: adamw_bf16
Precision: Pure BF16
Quantised: No
Xformers: Not used
LyCORIS Config:

{
    "bypass_mode": true,
    "algo": "lokr",
    "multiplier": 1.0,
    "full_matrix": true,
    "linear_dim": 10000,
    "linear_alpha": 1,
    "factor": 4,
    "apply_preset": {
        "target_module": [
            "Attention",
            "FeedForward"
        ],
        "module_algo_map": {
            "FeedForward": {
                "factor": 4
            },
            "JointTransformerBlock": {
                "factor": 2
            }
        }
    }
}

Datasets

signs

Repeats: 0
Total number of images: ~420
Total number of aspect buckets: 20
Resolution: 1.048576 megapixels
Cropped: False
Crop style: None
Crop aspect: None
Used for regularisation data: No

moviecollection

Repeats: 0
Total number of images: ~1983
Total number of aspect buckets: 7
Resolution: 1.048576 megapixels
Cropped: False
Crop style: None
Crop aspect: None
Used for regularisation data: No

bookcovers

Repeats: 0
Total number of images: ~927
Total number of aspect buckets: 26
Resolution: 1.048576 megapixels
Cropped: False
Crop style: None
Crop aspect: None
Used for regularisation data: No

shutterstock

Repeats: 0
Total number of images: ~21111
Total number of aspect buckets: 5
Resolution: 1.048576 megapixels
Cropped: False
Crop style: None
Crop aspect: None
Used for regularisation data: No

cinemamix-1mp

Repeats: 0
Total number of images: ~7425
Total number of aspect buckets: 1
Resolution: 1.048576 megapixels
Cropped: False
Crop style: None
Crop aspect: None
Used for regularisation data: No

anatomy

Repeats: 5
Total number of images: ~16440
Total number of aspect buckets: 3
Resolution: 1.048576 megapixels
Cropped: False
Crop style: None
Crop aspect: None
Used for regularisation data: No

Inference

import torch
from diffusers import DiffusionPipeline
from lycoris import create_lycoris_from_weights

model_id = 'stabilityai/stable-diffusion-3.5-medium'
adapter_id = 'pytorch_lora_weights.safetensors' # you will have to download this manually
lora_scale = 1.0
wrapper, _ = create_lycoris_from_weights(lora_scale, adapter_id, pipeline.transformer)
wrapper.merge_to()

prompt = "A photo-realistic image of a cat"
negative_prompt = 'blurry, cropped, ugly'
pipeline.to('cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu')
image = pipeline(
    prompt=prompt,
    negative_prompt=negative_prompt,
    num_inference_steps=20,
    generator=torch.Generator(device='cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu').manual_seed(1641421826),
    width=1024,
    height=1024,
    guidance_scale=3.0,
).images[0]
image.save("output.png", format="PNG")