Wan2.1-T2V-1.3B-nf4 / README.md
sarthak247's picture
Update README.md
bd75044 verified
metadata
license: apache-2.0
base_model: Wan-AI/Wan2.1-T2V-1.3B
language:
  - en
  - zh
pipeline_tag: text-to-video
tags:
  - video
  - video-generation
  - bitsandbytes
  - quantization
  - nf4
library_name: diffusers

Attempt to run Wan2.1-T2V-1.3B with lower VRAM

Changes made:

  • Diffusion Model: Changed all Linear layers from float32 to nf4 reducing model size from around 6GB to 1GB (approx)
  • VAE: No Linear layers so nothing to quantize here
  • T5-UMT Encoder: Pretty big model so having difficulty loading it in my poor 4060 (8GB VRAM) but this is the one which takes the most VRAM. If this can be quantized it can be very easy to run this.

Will add T5 Encoder model later if I can get it working