arxiv:2403.07605

Optimizing Negative Prompts for Enhanced Aesthetics and Fidelity in Text-To-Image Generation

Published on Mar 12

Authors:

Michael Ogezi ,

Abstract

In text-to-image generation, using negative prompts, which describe undesirable image characteristics, can significantly boost image quality. However, producing good negative prompts is manual and tedious. To address this, we propose NegOpt, a novel method for optimizing negative prompt generation toward enhanced image generation, using supervised fine-tuning and reinforcement learning. Our combined approach results in a substantial increase of 25% in Inception Score compared to other approaches and surpasses ground-truth negative prompts from the test set. Furthermore, with NegOpt we can preferentially optimize the metrics most important to us. Finally, we construct Negative Prompts DB, a dataset of negative prompts.

View arXiv page View PDF Add to collection

Community

andupotorac

Jun 3

Hey Michael. Are you guys planning to open source the code and DB so we can take this for a spin?

mikeogezi

Paper author Oct 20

Hi, Andu. I believe we've spoken about the dataset via other channels, but this version should be easier to work with: https://huggingface.co/datasets/mikeogezi/negopt_full.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

Upvote

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2403.07605 in a model README.md to link it from this page.

Datasets citing this paper 1

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2403.07605 in a Space README.md to link it from this page.

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.