|
--- |
|
license: openrail++ |
|
tags: |
|
- stable-diffusion |
|
- text-to-image |
|
--- |
|
|
|
# SD v2.1-base with Zero Terminal SNR (LAION Aesthetic 6+) |
|
|
|
This model is used in [Diffusion Model with Perceptual Loss](https://arxiv.org/abs/2401.00110) paper as the MSE baseline. |
|
|
|
This model is trained using zero terminal SNR schedule following [Common Diffusion Noise Schedules and Sample Steps are Flawed](https://arxiv.org/abs/2305.08891) paper on LAION aesthetic 6+ data. |
|
|
|
This model is finetuned from [stabilityai/stable-diffusion-2-1-base](https://huggingface.co/stabilityai/stable-diffusion-2-1-base). |
|
|
|
This model is meant for research demonstration, not for production use. |
|
|
|
## Usage |
|
|
|
```python |
|
from diffusers import StableDiffusionPipeline |
|
prompt = "A young girl smiling" |
|
pipe = StableDiffusionPipeline.from_pretrained("ByteDance/sd2.1-base-zsnr-laionaes6").to("cuda") |
|
pipe(prompt, guidance_scale=7.5, guidance_rescale=0.7).images[0].save("out.jpg") |
|
``` |
|
|
|
## Related Models |
|
|
|
* [bytedance/sd2.1-base-zsnr-laionaes5](https://huggingface.co/ByteDance/sd2.1-base-zsnr-laionaes5) |
|
* [bytedance/sd2.1-base-zsnr-laionaes6](https://huggingface.co/ByteDance/sd2.1-base-zsnr-laionaes6) |
|
* [bytedance/sd2.1-base-zsnr-laionaes6-perceptual](https://huggingface.co/ByteDance/sd2.1-base-zsnr-laionaes6-perceptual) |
|
|
|
|
|
## Cite as |
|
``` |
|
@misc{lin2024diffusion, |
|
title={Diffusion Model with Perceptual Loss}, |
|
author={Shanchuan Lin and Xiao Yang}, |
|
year={2024}, |
|
eprint={2401.00110}, |
|
archivePrefix={arXiv}, |
|
primaryClass={cs.CV} |
|
} |
|
|
|
@misc{lin2023common, |
|
title={Common Diffusion Noise Schedules and Sample Steps are Flawed}, |
|
author={Shanchuan Lin and Bingchen Liu and Jiashi Li and Xiao Yang}, |
|
year={2023}, |
|
eprint={2305.08891}, |
|
archivePrefix={arXiv}, |
|
primaryClass={cs.CV} |
|
} |
|
``` |