Update README.md
Browse files
README.md
CHANGED
@@ -61,7 +61,7 @@ Source code is available at https://github.com/NVlabs/Sana.
|
|
61 |
- **Model Description:** This is a model that can be used to generate and modify images based on text prompts.
|
62 |
It is a Linear Diffusion Transformer that uses one fixed, pretrained text encoders ([Gemma2-2B-IT](https://huggingface.co/google/gemma-2-2b-it))
|
63 |
and one 32x spatial-compressed latent feature encoder ([DC-AE](https://hanlab.mit.edu/projects/dc-ae)).
|
64 |
-
- **Resources for more information:** Check out our [GitHub Repository](https://github.com/NVlabs/Sana) and the [
|
65 |
|
66 |
### Model Sources
|
67 |
|
|
|
61 |
- **Model Description:** This is a model that can be used to generate and modify images based on text prompts.
|
62 |
It is a Linear Diffusion Transformer that uses one fixed, pretrained text encoders ([Gemma2-2B-IT](https://huggingface.co/google/gemma-2-2b-it))
|
63 |
and one 32x spatial-compressed latent feature encoder ([DC-AE](https://hanlab.mit.edu/projects/dc-ae)).
|
64 |
+
- **Resources for more information:** Check out our [GitHub Repository](https://github.com/NVlabs/Sana) and the [SANA-1.5 report on arXiv](https://arxiv.org/abs/2501.18427).
|
65 |
|
66 |
### Model Sources
|
67 |
|