ODPG: Outfitting Diffusion with Pose Guided Condition
Seohyun Lee, Jintae Park, Sanghyeok Park
2025
Abstract
Virtual Try-On (VTON) technology allows users to visualize how clothes would look on them without physically trying them on, gaining traction with the rise of digitalization and online shopping. Traditional VTON methods, often using Generative Adversarial Networks (GANs) and Diffusion models, face challenges in achieving high realism and handling dynamic poses. This paper introduces Outfitting Diffusion with Pose Guided Condition (ODPG), a novel approach that leverages a latent diffusion model with multiple conditioning inputs during the denoising process. By transforming garment, pose, and appearance images into latent features and integrating these features in a UNet-based denoising model, ODPG achieves non-explicit synthesis of garments on dynamically posed human images. Our experiments on the FashionTryOn and a subset of the DeepFashion dataset demonstrate that ODPG generates realistic VTON images with fine-grained texture details across various poses, utilizing an end-to-end architecture without the need for explicit garment warping processes. Future work will focus on generating VTON outputs in video format and on applying our attention mechanism, as detailed in the Method section, to other domains with limited data.
DownloadPaper Citation
in Harvard Style
Lee S., Park J. and Park S. (2025). ODPG: Outfitting Diffusion with Pose Guided Condition. In Proceedings of the 20th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 3: VISAPP; ISBN 978-989-758-728-3, SciTePress, pages 392-402. DOI: 10.5220/0013150600003912
in Bibtex Style
@conference{visapp25,
author={Seohyun Lee and Jintae Park and Sanghyeok Park},
title={ODPG: Outfitting Diffusion with Pose Guided Condition},
booktitle={Proceedings of the 20th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 3: VISAPP},
year={2025},
pages={392-402},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0013150600003912},
isbn={978-989-758-728-3},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 20th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 3: VISAPP
TI - ODPG: Outfitting Diffusion with Pose Guided Condition
SN - 978-989-758-728-3
AU - Lee S.
AU - Park J.
AU - Park S.
PY - 2025
SP - 392
EP - 402
DO - 10.5220/0013150600003912
PB - SciTePress