Papers
arxiv:2409.19946

Illustrious: an Open Advanced Illustration Model

Published on Sep 30
· Submitted by solbon1212 on Oct 2
Authors:
,
,
,

Abstract

In this work, we share the insights for achieving state-of-the-art quality in our text-to-image anime image generative model, called Illustrious. To achieve high resolution, dynamic color range images, and high restoration ability, we focus on three critical approaches for model improvement. First, we delve into the significance of the batch size and dropout control, which enables faster learning of controllable token based concept activations. Second, we increase the training resolution of images, affecting the accurate depiction of character anatomy in much higher resolution, extending its generation capability over 20MP with proper methods. Finally, we propose the refined multi-level captions, covering all tags and various natural language captions as a critical factor for model development. Through extensive analysis and experiments, Illustrious demonstrates state-of-the-art performance in terms of animation style, outperforming widely-used models in illustration domains, propelling easier customization and personalization with nature of open source. We plan to publicly release updated Illustrious model series sequentially as well as sustainable plans for improvements.

Community

Paper author Paper submitter

Available in CivitAI : https://civitai.com/models/795765?modelVersionId=889818

Recorded 3.2K Downloads with overwhelmingly positive evaluations.

Sign up or log in to comment

Models citing this paper 1

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2409.19946 in a dataset README.md to link it from this page.

Spaces citing this paper 3

Collections including this paper 1