Text-to-3D
3d
File size: 2,011 Bytes
9ebf9a0
cd703db
 
 
 
 
 
 
 
 
 
9ebf9a0
 
e1db56f
cd703db
 
 
e1db56f
cd703db
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
e1db56f
cd703db
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
---
datasets:
- Objaverse
tags:
- 3d
extra_gated_fields:
  Name: text
  Email: text
  Country: text
  Organization or Affiliation: text
  I ALLOW Stability AI to email me about new model releases: checkbox
license: other
---
# `Stable Zero123`

## Model Description

Stable Zero123 is a model for view-conditioned image generation based on [Zero123](https://github.com/cvlab-columbia/zero123). Following alterations in the dataset rendering procedures and model conditioning strategies, our model demonstrates improved performance when compared to the original Zero123 and its subsequent iteration, Zero123-XL.

**Insert nvs comparison image here**

## Usage

**Add threestudio link and usage here**


## Model Details

* **Developed by**: [Stability AI](https://stability.ai/)
* **Model type**: latent diffusion model.
* **Finetuned from model**: [lambdalabs/sd-image-variations-diffusers](https://huggingface.co/lambdalabs/sd-image-variations-diffusers)
* **License**: [StabilityAI Non-Commercial Research Community License](https://huggingface.co/stabilityai/zero123-sai/raw/main/LICENSE). If you want to use this model for your commercial products or purposes, please contact us [here](https://stability.ai/contact) to learn more.

### Training Dataset

We use renders from the [Objaverse](https://objaverse.allenai.org/objaverse-1.0) dataset, utilizing our enhanced rendering method

### Training Infrastructure

* **Hardware**: `Stable Zero123` was trained on the Stability AI cluster on a single node with 8 A100 80GBs GPUs.
* **Code Base**: We use our modified version of [the original zero123 repository](https://github.com/cvlab-columbia/zero123).


### Misuse, Malicious Use, and Out-of-Scope Use

The model should not be used to intentionally create or disseminate images that create hostile or alienating environments for people. This includes generating images that people would foreseeably find disturbing, distressing, or offensive; or content that propagates historical or current stereotypes.