MirrorDiffusion: Stabilizing Diffusion Process in Zero-shot Image Translation by Prompts Redescription and Beyond

Yupei Lin¹ Xiaoyu Xian² Yukai Shi¹ Liang Lin³

¹ Guangdong University of Technology ² CRRC Academy ³ Sun Yat-sen University

[Paper] | [Code]

Abstract

Recently, text-to-image diffusion models become a new paradigm in image processing fields, including content generation, image restoration and image-to-image translation. Given a target prompt, Denoising Diffusion Probabilistic Models (DDPM) are able to generate realistic yet eligible images. With this appealing property, the image translation task has the potential to be free from target image samples for supervision. By using a target text prompt for domain adaption, the diffusion model is able to implement zero-shot image-to-image translation advantageously. However, the sampling and inversion processes of DDPM are stochastic, and thus the inversion process often fail to reconstruct the input content. Specifically, the displacement effect will gradually accumulated during the diffusion and inversion processes, which led to the reconstructed results deviating from the source domain. To make reconstruction explicit, we propose a prompt redescription strategy to realize a mirror effect between the source and reconstructed image in the diffusion model (MirrorDiffusion). More specifically, a prompt redescription mechanism is investigated to align the text prompts with latent code at each time step of the Denoising Diffusion Implicit Models (DDIM) inversion to pursue a structure-preserving reconstruction. With the revised DDIM inversion, MirrorDiffusion is able to realize accurate zero-shot image translation by editing optimized text prompts and latent code. Extensive experiments demonstrate that MirrorDiffusion achieves superior performance over the state-of-the-art methods on zero-shot image translation benchmarks by clear margins and practical model stability.

Method

Experiment

Dataset

C2D-F and C2G-F: [Google Drive].
S2O-F: [Google Drive].
H2Z-F: [Google Drive].

MirrorDiffusion: Stabilizing Diffusion Process in Zero-shot Image Translation by Prompts Redescription and Beyond

Yupei Lin¹ Xiaoyu Xian² Yukai Shi¹ Liang Lin³

¹ Guangdong University of Technology ² CRRC Academy ³ Sun Yat-sen University

Abstract

Method

Experiment

Dataset

Comparisons

More Results

MirrorDiffusion: Stabilizing Diffusion Process in Zero-shot Image Translation by Prompts Redescription and Beyond

Yupei Lin1 Xiaoyu Xian2 Yukai Shi1 Liang Lin3

1 Guangdong University of Technology 2 CRRC Academy 3 Sun Yat-sen University

Abstract

Method

Experiment

Dataset

Comparisons

More Results

Yupei Lin¹ Xiaoyu Xian² Yukai Shi¹ Liang Lin³

¹ Guangdong University of Technology ² CRRC Academy ³ Sun Yat-sen University