Image Feature Extraction
English
BiGR / README.md
haoosz's picture
Add pipeline tag (#1)
ba46eed verified
metadata
license: mit
datasets:
  - ILSVRC/imagenet-1k
language:
  - en
base_model:
  - haoosz/BiGR
pipeline_tag: image-feature-extraction

This is the official model release for the paper:

BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation Capabilities

Please download the pretrained weights for tokenizers and BiGR models to run our code.

Binary Autoencoder

We train Binary Autoencoder (B-AE) by adapting the official code of Binary Latent Diffusion. We provide pretrained weights for different configurations.

256x256 resolution

B-AE Size Checkpoint
d24 332M download
d32 332M download

512x512 resolution

B-AE Size Checkpoint
d32-512 315M download

BiGR models ✨

We provide pretrained weights for BiGR models in various sizes.

256x256 resolution

Model B-AE Size Checkpoint
BiGR-L-d24 d24 1.35G download
BiGR-XL-d24 d24 3.20G download
BiGR-XXL-d24 d24 5.92G download
BiGR-XXL-d32 d32 5.92G download

512x512 resolution

Model B-AE Size Checkpoint
BiGR-L-d32-res512 d32-res512 1.49G download