|
--- |
|
pipeline_tag: text-to-image |
|
--- |
|
|
|
Lumina-Image-2.0 is a 2 billion parameter flow-based diffusion transformer capable of generating images from text descriptions. |
|
|
|
|
|
## Usage |
|
|
|
```python |
|
import torch |
|
from diffusers import Lumina2Text2ImgPipeline |
|
|
|
pipe = Lumina2Text2ImgPipeline.from_pretrained("Alpha-VLLM/Lumina-Image-2.0", torch_dtype=torch.bfloat16) |
|
pipe.enable_model_cpu_offload() #save some VRAM by offloading the model to CPU. Remove this if you have enough GPU power |
|
|
|
prompt = "A dog holding a sign that says hello world" |
|
image = pipe( |
|
prompt, |
|
height=1024, |
|
width=1024, |
|
guidance_scale=4.0, |
|
num_inference_steps=50, |
|
cfg_trunc_ratio=0.25, |
|
cfg_normalization=True, |
|
generator=torch.Generator("cpu").manual_seed(0) |
|
).images[0] |
|
image.save("lumina_demo.png") |
|
``` |