Vanast: Virtual Try-On with Human Image Animation via Synthetic Triplet Supervision

Seoul National University
CVPR 2026

TL;DR: Given a human image and one or more garment images, our method generates virtual try-on with human image animation conditioned on a pose video while preserving identity.

Results

Note that all result videos are complete zero-shot inference results that are not included in the training set, without any additional training/optimization.
We present results of transferring attributes to the portrait while simultaneously animating it according to the facial keypoint video.

Upper-body Clothing Transfer

Lower-body Clothing Transfer

Dress Transfer

Hat Transfer

Applications

All result videos are complete zero-shot inferences, not included in the training set and without any additional training/optimization, showing the transfer of multiple attributes simultaneously to the portrait or interpolation between attributes.

Multiple Garment Transfer

In-the-wild Garment Transfer

Garment Interpolation

BibTeX


Coming Soon!