Patil commited on
Commit
3bdcf42
·
verified ·
1 Parent(s): 85efd24

Delete README.md

Browse files
Files changed (1) hide show
  1. README.md +0 -90
README.md DELETED
@@ -1,90 +0,0 @@
1
- ---
2
- license: apache-2.0
3
- language:
4
- - zh
5
- - en
6
- tags:
7
- - text-to-image
8
- - stable-diffusion
9
- - kolors
10
- ---
11
- # Kolors: Effective Training of Diffusion Model for Photorealistic Text-to-Image Synthesis
12
- <div align="center" style="display: flex; justify-content: center; flex-wrap: wrap;">
13
- <a href="https://github.com/Kwai-Kolors/Kolors"><img src="https://img.shields.io/static/v1?label=Kolors Code&message=Github&color=blue&logo=github-pages"></a> &ensp;
14
- <a href="https://kwai-kolors.github.io/"><img src="https://img.shields.io/static/v1?label=Team%20Page&message=Page&color=green"></a> &ensp;
15
- <a href="https://github.com/Kwai-Kolors/Kolors/blob/master/imgs/Kolors_paper.pdf"><img src="https://img.shields.io/static/v1?label=Tech Report&message=Arxiv:Kolors&color=red&logo=arxiv"></a> &ensp;
16
- <a href="https://kolors.kuaishou.com/"><img src="https://img.shields.io/static/v1?label=Official Website&message=Page&color=green"></a>
17
- </div>
18
- <figure>
19
- <img src="imgs/head_final3.png">
20
- </figure>
21
- <br>
22
-
23
- ## 📖 Introduction
24
- Kolors is a large-scale text-to-image generation model based on latent diffusion, developed by the Kuaishou Kolors team. Trained on billions of text-image pairs, Kolors exhibits significant advantages over both open-source and proprietary models in visual quality, complex semantic accuracy, and text rendering for both Chinese and English characters. Furthermore, Kolors supports both Chinese and English inputs, demonstrating strong performance in understanding and generating Chinese-specific content. For more details, please refer to this <a href="https://github.com/Kwai-Kolors/Kolors/blob/master/imgs/Kolors_paper.pdf">technical report</a></b>.
25
-
26
-
27
- ## 🚀 Quick Start
28
- ### Requirements
29
-
30
- * Python 3.8 or later
31
- * PyTorch 1.13.1 or later
32
- * Transformers 4.26.1 or later
33
- * Recommended: CUDA 11.7 or later
34
- <br>
35
-
36
- 1. Repository cloning and dependency installation
37
-
38
- ```bash
39
- apt-get install git-lfs
40
- git clone https://github.com/Kwai-Kolors/Kolors
41
- cd Kolors
42
- conda create --name kolors python=3.8
43
- conda activate kolors
44
- pip install -r requirements.txt
45
- python3 setup.py install
46
- ```
47
- 2. Weights download([link](https://huggingface.co/Kwai-Kolors/Kolors)):
48
- ```bash
49
- huggingface-cli download --resume-download Kwai-Kolors/Kolors --local-dir weights/Kolors
50
- ```
51
- or
52
- ```bash
53
- git lfs clone https://huggingface.co/Kwai-Kolors/Kolors weights/Kolors
54
- ```
55
- 3. Inference:
56
- ```bash
57
- python3 scripts/sample.py "一张瓢虫的照片,微距,变焦,高质量,电影,拿着一个牌子,写着“可图”"
58
- # The image will be saved to "scripts/outputs/sample_test.jpg"
59
- ```
60
-
61
- ### Using with Diffusers
62
- Please refer to https://huggingface.co/Kwai-Kolors/Kolors-diffusers.
63
-
64
- ## 📜 License&Citation
65
- ### License
66
- Kolors are fully open-sourced for academic research. For commercial use, please fill out this [questionnaire](https://github.com/Kwai-Kolors/Kolors/blob/master/imgs/可图KOLORS模型商业授权申请书.docx) and sent it to [email protected] for registration.
67
-
68
- We open-source Kolors to promote the development of large text-to-image models in collaboration with the open-source community. The code of this project is open-sourced under the Apache-2.0 license. We sincerely urge all developers and users to strictly adhere to the [open-source license](MODEL_LICENSE), avoiding the use of the open-source model, code, and its derivatives for any purposes that may harm the country and society or for any services not evaluated and registered for safety. Note that despite our best efforts to ensure the compliance, accuracy, and safety of the data during training, due to the diversity and combinability of generated content and the probabilistic randomness affecting the model, we cannot guarantee the accuracy and safety of the output content, and the model is susceptible to misleading. This project does not assume any legal responsibility for any data security issues, public opinion risks, or risks and liabilities arising from the model being misled, abused, misused, or improperly utilized due to the use of the open-source model and code.
69
-
70
-
71
- ### Citation
72
- If you find our work helpful, please cite it!
73
-
74
- ```
75
- @article{kolors,
76
- title={Kolors: Effective Training of Diffusion Model for Photorealistic Text-to-Image Synthesis},
77
- author={Kolors Team},
78
- journal={arXiv preprint},
79
- year={2024}
80
- }
81
- ```
82
-
83
- ### Acknowledgments
84
- - Thanks to [Diffusers](https://github.com/huggingface/diffusers) for providing the codebase.
85
- - Thanks to [ChatGLM3](https://github.com/THUDM/ChatGLM3) for providing the powerful Chinese language model.
86
- <br>
87
-
88
- ### Contact Us
89
-
90
- If you want to leave a message for our R&D team and product team, feel free to join our [WeChat group](https://github.com/Kwai-Kolors/Kolors/blob/master/imgs/wechat.png). You can also contact us via email ([email protected]).