CatV2TON: Taming Diffusion Transformers for Vision-Based Virtual Try-On with Temporal Concatenation Paper โข 2501.11325 โข Published 25 days ago โข 4