Eleet Model: an anime style stable diffusion model
This model is also available on: Eleet Model - Civitai.
Eleet model is a block-weighted merged stable diffusion model aiming at generating good quality 2D anime style images that satisfy my personal taste and, hopefully, your taste.
ELEET is an Internet slang for the word 'elite', which dates back to 80s/90s. Its modified spelling in a digital form is 31337, which has been the most common Eta Noise Seed Delta (ENSD) value in stable diffusion probably since the NAI era. The model is named in honor of this tradition.
Suggested settings
For users who know little about stable diffusion settings, I recommend:
Prompt: Start with
masterpiece, best quality, aesthetic
. Personally I also like to add photography phrases such ascinematic lighting, professional shadow
, etc.Negative prompt:
(worst quality, low quality:1.4), lowres, bad anatomy, (blurry), (text, logo, watermark, signature, username)
Sampler: DPM++ 2M Karras
CFG Scale: 6~9
Steps: 16~30
Highres Fix (optional): Latent sampler; 0.6~0.7 Denoising strength; 16 Highres steps.
Clip skip: 2
No external VAE needed
Samples
Here are 4 samples of the latest Eleet model version.
Sample 1 (Highres Fix from 512x800):
masterpiece, best quality, aesthetic, 1girl, solo, black eyes, green hair, low-tied long hair, school uniform, sitting, wariza, thighhighs, black thighhighs, from above, looking at viewer, (cityscape), cinematic lighting, professional shadow
Other common settings through all samples:
Negative prompt: (worst quality, low quality:1.4), lowres, bad anatomy, (child, loli), (blurry), (text, logo, watermark, signature, username)
Steps: 20, Sampler: DPM++ 2M Karras, CFG scale: 7, Denoising strength: 0.6, Clip skip: 2, Hires upscale: 1.25, Hires steps: 16, Hires upscaler: Latent
Sample 2 (Highres Fix from 512x800):
masterpiece, best quality, aesthetic, 1girl, solo, red eyes, one eye closed, brown hair, long hair, grin, arms up, crop top, denim shorts, midriff, navel, athletic body, earings, (cowboy shot), looking at viewer, (waterfall), sunny, cinematic lighting, professional shadow
Sample 3 (Highres Fix from 832x512):
masterpiece, best quality, aesthetic, no humans, scenery, sky, cloud, outdoors, mountain, water, sunset, cloudy sky, lake, river, landscape, reflection, tree, nature, mountainous horizon, blue sky, evening, professional shadow, award winner photo
Sample 4 (Highres Fix from 800x512):
masterpiece, best quality, aesthetic, forest, grass, blue sky, cloud, outdoors, no humans, nature, scenery, railroad tracks, sunlight, sunbeam, lens flare, cinematic lighting, professional shadow
Additionally, here are samples of the previous model versions. Click to expand:
Eleet v1.0 samples
Sample 1: Scenery. txt2img+highres, (640x384) x1.5.
masterpiece, best quality, aesthetic, highres RAW photo, landscape photography, wide shot, from below, scenery, sunrise, blue sky, clouds, lake, reflection, sun, trees, floating leaves, ripples, foreground interest, depth of field, cinematic lighting, asymmetric composition, professional shadows, sharp focus, lens flare
Sample 2: Anime girl. txt2img, 576x832.
masterpiece, best quality, aesthetic, 1girl, :d, blue eyes, fox ears, gold hair, twin braids, crop top, off-shoulder jacket, blue pleated skirt, thighhighs, garter belt, skindentation, breasts, bare shoulders, midriff, (cowboy shot), looking at viewer, waterfall, mountains
Sample 3: Scenery. txt2img, 832x576.
masterpiece, best quality, aesthetic, highres RAW photo, cool color tone, wide shot, scenery, snow, blue sky, cityscape, skyline, buildings, rooftop, roads, cinematic lighting, professional shadows, sharp focus
Sample 4: Anime girl. txt2img+highres, (384x640) x1.5.
masterpiece, best quality, 1girl, solo, angry, brown eyes, blue hair, bangs, sidelocks, half updo, clenched hand, parted lips, [bodysuit|armored dress], long sleeves, gloves, armor, looking to the side, clenched teeth, (cowboy shot), night, full moon, forest, underexposure, professional lighting
Merge ideas
The weights for merging Eleet model were optimized through an automatic procedure with scoring, but I didn't necessarily pick the best-scored one as the final version. Instead, I will evaluate a number of high-scored candidates and score their outputs manually by myself.
I conducted the evaluation mostly on anime girls topics (no doubt) but also considered the model performance on scenery images. I will consider:
- Prompt response
- Image aesthetic quality over 3 scenarios:
- txt2img under
--lowvram
mode on a low-end GPU - txt2img under normal or
--medvram
mode on a better GPU - txt2img + highres fix on the previous GPU
- txt2img under
- Image flaws (color shift, illogical drawing, etc.) over the above scenarios
License
CreativeML OpenRAIL-M.