Textual inversion: Are the imagenet templates fixed?

#84

by xalex - opened Oct 5, 2022

Oct 5, 2022

The imagenet templates for objects all talk about a photo, which may be not ideal to train on drawn objects and other things that are not on photos.

Are the templates fixed (e.g. CLIP expects exactly these strings) or can one just change them or add a few like "a picture of {}", "a drawing of {}" and so on?

carlthome

Feb 28

I'm also wondering this.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment