Our framework consists of three key stages: (1) Reference Motion Extraction captures motion patterns from early-timestep conditional scores through gradient fields ∇z log p(z|y). (2) Motion Transfer with MSG combines content and motion scores through our novel Mixture of Score Guidance formulation, enabling precise control over motion transfer while preserving scene coherence. (3) MSG Path Redirection employs implicit attention-guided dynamics to ensure stable motion transfer by navigating the diffusion process through modified Langevin dynamics. This zero-shot approach operates directly on pre-trained models without additional training, successfully handling diverse scenarios from single object transformations to complex camera trajectories.
@misc{yesiltepe2024motionshop,
title={MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance},
author={Hidir Yesiltepe and Tuna Han Salih Meral and Connor Dunlop and Pinar Yanardag},
year={2024},
eprint={2412.05355},
archivePrefix={arXiv},
primaryClass={cs.CV}
}