Sayak Paul's picture

Sayak Paul

sayakpaul

·

https://sayak.dev

AI & ML interests

Diffusion models, representation learning

Recent Activity

updated a model 28 minutes ago

sayakpaul/FLUX.1-Canny-dev-nf4

updated a model 33 minutes ago

sayakpaul/FLUX.1-Depth-dev-nf4

updated a model 34 minutes ago

sayakpaul/FLUX.1-Fill-dev-nf4

View all activity

Articles

🧨 Diffusers welcomes Stable Diffusion 3.5 Large

Memory-efficient Diffusion Transformers with Quanto and Diffusers

🧨 Diffusers welcomes Stable Diffusion 3

🤗 PEFT welcomes new merging methods

Welcome aMUSEd: Efficient Text-to-Image Generation

SDXL in 4 steps with Latent Consistency LoRAs

Personal Copilot: Train Your Own Coding Assistant

Exploring simple optimizations for SDXL

Finetune Stable Diffusion Models with DDPO via TRL

Introducing Würstchen: Fast Diffusion for Image Generation

Efficient Controllable Generation for SDXL with T2I-Adapters

Happy 1st anniversary 🤗 Diffusers!

Optimizing Stable Diffusion for Intel CPUs with NNCF and 🤗 Optimum

Instruction-tuning Stable Diffusion with InstructPix2Pix

Training a language model with 🤗 Transformers using TensorFlow and TPUs

ControlNet in Diffusers 🧨

🤗 PEFT: Parameter-Efficient Fine-Tuning of Billion-Scale Models on Low-Resource Hardware

A Dive into Pretraining Strategies for Vision-Language Models

The State of Computer Vision at Hugging Face 🤗

Using LoRA for Efficient Stable Diffusion Fine-Tuning

Image Similarity with Hugging Face Datasets and Transformers

Deploying 🤗 ViT on Vertex AI

Deploying 🤗 ViT on Kubernetes with TF Serving

Deploying TensorFlow Vision Models in Hugging Face with TF Serving

Organizations

Posts 13

Post

2181

It's been a while we shipped native quantization support in diffusers 🧨

We currently support bistandbytes as the official backend but using others like torchao is already very simple.

This post is just a reminder of what's possible:

1. Loading a model with a quantization config
2. Saving a model with quantization config
3. Loading a pre-quantized model
4. enable_model_cpu_offload()
5. Training and loading LoRAs into quantized checkpoints

Docs:
https://huggingface.co/docs/diffusers/main/en/quantization/bitsandbytes

Post

2596

Did some little experimentation to resize pre-trained LoRAs on Flux. I explored two themes:

* Decrease the rank of a LoRA
* Increase the rank of a LoRA

The first one is helpful in reducing memory requirements if the LoRA is of a high rank, while the second one is merely an experiment. Another implication of this study is in the unification of LoRA ranks when you would like to torch.compile() them.

Check it out here:
sayakpaul/flux-lora-resizing

Collections 2

Papers 11

arxiv:2408.13467

arxiv:2406.06424

arxiv:2404.01197

arxiv:2402.17412

spaces 19

Demo Docker Gradio

Diffusers Docs QA Chatbot

Ask questions to the Diffusers documentation.

Convert Kerascv SD to Diffusers

Inpainting Tool

Generate Custom Pokemons with Stable Diffusion

Evaluate StableDiffusionPipeline with Different Schedulers

models 75

sayakpaul/mochi-lora

Updated 7 minutes ago

sayakpaul/FLUX.1-Canny-dev-nf4

Updated 29 minutes ago

sayakpaul/FLUX.1-Depth-dev-nf4

Updated 33 minutes ago

sayakpaul/FLUX.1-Fill-dev-nf4

Updated 34 minutes ago

sayakpaul/mochi-lora-lr_1e-5-w_none-bit_no8bit

Updated 4 days ago

sayakpaul/mochi-lora-lr_1e-5-w_logit_normal-bit_no8bit

Updated 4 days ago

sayakpaul/mochi-lora-lr_1e-4-w_none-bit_no8bit

Updated 4 days ago

sayakpaul/mochi-lora-lr_1e-4-w_logit_normal-bit_no8bit

Updated 4 days ago

sayakpaul/optimizer_adamw_steps_1000_lr-schedule_cosine_with_restarts_learning-rate_5e-4_rank_

Text-to-Video • Updated 7 days ago • 9 • 1

sayakpaul/optimizer_adamw_steps_1000_lr-schedule_cosine_with_restarts_learning-rate_3e-4_rank_

Text-to-Video • Updated 7 days ago • 8 • 1

datasets 29

sayakpaul/pd12m-full

Updated about 18 hours ago • 5.22k • 6

sayakpaul/pick-a-pic-v2-unique-prompts

Viewer • Updated 15 days ago • 59k • 121

sayakpaul/sample-datasets

Viewer • Updated 24 days ago • 6 • 24.1k • 1

sayakpaul/poses-controlnet-dataset

Viewer • Updated Aug 29 • 496 • 61 • 5

sayakpaul/torchao-diffusers

Updated Aug 28 • 99

sayakpaul/pickapic_v2_webdataset

Viewer • Updated Apr 4 • 8.7k • 485

sayakpaul/generated-gemini-responses

Viewer • Updated Apr 1 • 115 • 39

sayakpaul/no_robots_only_coding

Viewer • Updated Mar 20 • 350 • 51 • 1

sayakpaul/diffusers-qa-chatbot-artifacts

Viewer • Updated Mar 9 • 265k • 249 • 1

sayakpaul/mgie-results

Viewer • Updated Feb 16 • 8 • 47