---
tags:
- text-to-image
- lora
- diffusers
- template:diffusion-lora
widget:
- text: >-
    flat color, no lineart, blending, negative space, artist:[john kafka|ponsuke
    kaikai|hara id 21|yoneyama mai|fuzichoco],  1girl, hoshimachi suisei,
    virtual youtuber, blue hair, side ponytail, cowboy shot, black shirt, star
    print, off shoulder, outdoors, starry sky, wariza, looking up, half-closed
    eyes, black sky,  live2d animation, upper body, high quality cinematic video
    of a woman sitting under the starry night sky. The Camera is steady, This is
    a cowboy shot. The animation is smooth and fluid.
  parameters:
    negative_prompt: >-
      色调艳丽，过曝，静态，细节模糊不清，字幕，风格，作品，画作，画面，静止，整体发灰，最差质量，低质量，JPEG压缩残留，丑陋的，残缺的，多余的手指，画得不好的手部，画得不好的脸部，畸形的，毁容的，形态畸形的肢体，手指融合，静止不动的画面，杂乱的背景，三条腿，背景人很多，倒着走
  output:
    url: images/ComfyUI_00185_.webp
- text: >-
    flat color, no lineart, blending, negative space, artist:[john kafka|ponsuke
    kaikai|hara id 21|yoneyama mai|fuzichoco],  1girl, sakura miko, pink hair,
    cowboy shot, white shirt, floral print, off shoulder, outdoors, cherry
    blossom, tree shade, wariza, looking up, falling petals, half-closed eyes,
    white sky, clouds,  live2d animation, upper body, high quality cinematic
    video of a woman sitting under a sakura tree. The Camera is steady, This is
    a cowboy shot. The animation is smooth and fluid.
  parameters:
    negative_prompt: >-
      色调艳丽，过曝，静态，细节模糊不清，字幕，风格，作品，画作，画面，静止，整体发灰，最差质量，低质量，JPEG压缩残留，丑陋的，残缺的，多余的手指，画得不好的手部，画得不好的脸部，畸形的，毁容的，形态畸形的肢体，手指融合，静止不动的画面，杂乱的背景，三条腿，背景人很多，倒着走
  output:
    url: images/ComfyUI_00177_.webp
base_model: Wan-AI/Wan2.1-T2V-14B
instance_prompt: flat color, no lineart
license: apache-2.0
---
# Flat Color - Style

<Gallery />

## Model description 

Flat Color - Style
Trained on images without visible lineart, flat colors, and little to no indication of depth.

Text to Video previews generated with [ComfyUI_examples/wan/#text-to-video](https://comfyanonymous.github.io/ComfyUI_examples/wan/#text-to-video)

Loading the LoRA with LoraLoaderModelOnly node.

Reprinted from CivitAI by request: https://civitai.com/models/1132089?modelVersionId=1474944

## Trigger words

You should use `flat color` to trigger the image generation.

You should use `no lineart` to trigger the image generation.

## Using with Diffusers
```py
pip install git+https://github.com/huggingface/diffusers.git
```

```py
import torch
from diffusers.utils import export_to_video
from diffusers import AutoencoderKLWan, WanPipeline
from diffusers.schedulers.scheduling_unipc_multistep import UniPCMultistepScheduler

# Available models: Wan-AI/Wan2.1-T2V-14B-Diffusers, Wan-AI/Wan2.1-T2V-1.3B-Diffusers
model_id = "Wan-AI/Wan2.1-T2V-14B-Diffusers"
vae = AutoencoderKLWan.from_pretrained(model_id, subfolder="vae", torch_dtype=torch.float32)
pipe = WanPipeline.from_pretrained(model_id, vae=vae, torch_dtype=torch.bfloat16)
flow_shift = 5.0  # 5.0 for 720P, 3.0 for 480P
pipe.scheduler = UniPCMultistepScheduler.from_config(pipe.scheduler.config, flow_shift=flow_shift)
pipe.to("cuda")

pipe.load_lora_weights("motimalu/wan-flat-color-v2")

pipe.enable_model_cpu_offload() #for low-vram environments

prompt = "A cat wandering around new york city"
negative_prompt = "Bright tones, overexposed, static, blurred details, subtitles, style, works, paintings, images, static, overall gray, worst quality, low quality, JPEG compression residue, ugly, incomplete, extra fingers, poorly drawn hands, poorly drawn faces, deformed, disfigured, misshapen limbs, fused fingers, still picture, messy background, three legs, many people in the background, walking backwards"

output = pipe(
    prompt=prompt,
    negative_prompt=negative_prompt,
    height=480,
    width=720,
    num_frames=81,
    guidance_scale=5.0,
).frames[0]
export_to_video(output, "output.mp4", fps=16)
```

## Download model

Weights for this model are available in Safetensors format.

[Download](/motimalu/wan-flat-color-v2/tree/main) them in the Files & versions tab.