fp8?

by Joviex - opened 4 days ago

Discussion

Joviex

4 days ago

Making sure the links on the Model Card are supposed to work?

https://huggingface.co/shuttleai/shuttle-3.1-aesthetic-fp8

Doesnt point anywhere?

Cheers

xtristan

ShuttleAI org 4 days ago

Hello, we have not fully released the model yet, we are working on the fp8 and the gguf version! 😁

xtristan

ShuttleAI org 3 days ago

fp8 has been added, gguf will be soon

edtjulien

3 days ago

•

edited 3 days ago

Hello,

Incredible work! Thanks!

How to use the fp8 with Diffusers? You have a working example?

fullsoftwares

3 days ago

I am also looking for FP8 with Diffusers demo. Not interested in Comfyi.

xtristan

ShuttleAI org 3 days ago

Hello @edtjulien @fullsoftwares

To run fp8 with diffusers, I usually just do

from optimum.quanto import freeze, quantize, qint8, qfloat8, qint4
quantize(
    pipe.transformer,
    # weights=qfloat8,
    weights=qint8,
    exclude=[
        "*.norm", "*.norm1", "*.norm2", "*.norm2_context",
        "proj_out", "x_embedder", "norm_out", "context_embedder",
    ],
)
freeze(pipe.transformer)
# pipe.enable_model_cpu_offload()

edtjulien

2 days ago

That's work like a charm. Thanks!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment