File size: 5,342 Bytes
ef156f7 35a8995 1ca4150 6aba279 ef156f7 91e0b58 31a22f8 91e0b58 31a22f8 91e0b58 31a22f8 9e0d279 8f2e5df 35a8995 8f2e5df 31a22f8 eb27927 31a22f8 35a8995 31a22f8 35a8995 31a22f8 35a8995 31a22f8 388f09e 35a8995 6aeba59 7378f29 9e0d279 8a1d8b6 1ca4150 35a8995 f8c89f0 0e3cde0 31a22f8 9e0d279 2966b25 35a8995 31a22f8 52dfdd9 cdb54c9 9b65d07 cdb54c9 2d35e47 cdb54c9 9b65d07 2966b25 31a22f8 b46ccc5 d51d402 31a22f8 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 |
---
license: cc0-1.0
tags:
- art
- stable-diffusion
- text-to-image
- cc0
---
# CC0_rebuild_attempt
**Version Number:** 0.1
## Summary
CC0_rebuild_attempt is a text-to-image model based on the Stable Diffusion 1.5 architecture. It is trained exclusively on CC0 images and other permissive content, aiming to produce high-quality artistic images from given text prompts. The goal is to create a robust and versatile model while ensuring the dataset used is entirely within the public domain, allowing for unrestricted use.
A mixed technic was used to create te capitons of images, the datatset was segmented my subject and for each it was used a different method such as: GIT for realistic and photo images and CLIP for illustrations at the final everything was human correct.
### Training Overview
**Input:** Manual captioned images at 768x
**Output:** Images
**Architecture:** Stable Diffusion 1.5
## Performance Limitations
CC0_rebuild_attempt may face challenges in generating highly detailed or realistic images due to the constraints of the CC0 and permissive content datasets. Additionally, the model may underperform in specific domains where high-quality, diverse CC0 images are less prevalent.
## Training Dataset Limitations
The model is trained on images and content from the following sources:
- **Pexels:** Pexels License
- **LIBRESHOT:** CC0
- **Unsplash:** Lite Dataset License
- **opengameart.org:** CC0
- **Authors:** CC0
- **Contributors:** CC0
- **Met Museum Open Access** CC0
Dataset sample:
![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/619bb3c9b392787f0f3ead14/OP3ps7Lad71u9fOAEUyAk.jpeg)
Although the dataset consists of CC0 and permissive content, in respect of individual site policies and the creators' preferences, only images that explicitly allow redistribution will be published. The remaining images will be indexed but not redistributed.
These datasets may not cover all possible themes or subjects comprehensively. The dataset may lack representation of certain modern or niche topics due to the limited availability of such content under these licenses. Additionally, the model was trained and developed with a focus on compliance with the Brazilian Copyright Act, which imposes stricter regulations compared to other jurisdictions due to the absence of fair use provisions.
It is important to note that while every effort has been made to ensure the model generates ethical and high-quality content, it is not possible to guarantee that the model will always avoid producing unwanted content or achieve the highest quality in every instance. This project represents an attempt to create an ethical model within the constraints of the available datasets and legal considerations.
Copyright laws differ from country to country, and this project acknowledges the necessity of establishing guidelines for considering public domain content. It is hoped that this research will inspire others to build more responsible models, taking into account the complexities of copyright laws and the ethical use of training data.
Contact before use in commercial projects.
## Associated Risks
* The model might struggle with generating highly detailed text within images.
* There may be limitations in creating complex scenes that require deep compositional understanding.
* The quality and diversity of generated images are dependent on the availability and variety of CC0 and permissive content.
* Potential bias towards subjects and styles that are more commonly found in CC0 and permissive content datasets and the manual capition.
* Please check OpenRAIL and make responsable use of open framework
## Intended Uses
* Generative art and design projects
* Educational tools and research in generative AI
* Creative experimentation and artistic expression
* Reference for ethical development
## Showcase
![IMG-20240718-WA0306.jpg](https://cdn-uploads.huggingface.co/production/uploads/619bb3c9b392787f0f3ead14/-ztWvjiM1cEHnpFTd2g1T.jpeg)
![IMG-20240618-WA0149.jpg](https://cdn-uploads.huggingface.co/production/uploads/619bb3c9b392787f0f3ead14/-E1zkaqUK68GZef1cXvi2.jpeg)
![IMG-20240724-WA0115.jpg](https://cdn-uploads.huggingface.co/production/uploads/619bb3c9b392787f0f3ead14/wX1UuYI4iEWQZhbO130Qc.jpeg)
## Authors' Note
I firmly believe that intellectual property laws have often been utilized as a means of controlling and restricting access to content, effectively acting as a form of censorship. When companies use unauthorized content, nothing happens, but for individuals, the consequences are much more severe. Many massive models have used, and continue to use, unauthorized content, and this is unlikely to change. It's a sad reality that has happened before and will keep happening. However, this is the first time these technologies are open source and widely available for everyone. Neither governments nor companies will decide the fate of society, but the work and actions of individual people. We might have to change how we think about what is considered original or creative. At the same time, this shift in perspective is crucial as we navigate the evolving landscape of AI and copyright. I hope this work inspires more responsible and ethical development practices in the field of generative AI.
## Contact
[email protected]
``` |