Official PyTorch implementation of "PAP:Prompt-Agnostic Adversarial Perturbation for Customized Diffusion Models"
Cong Wan
Yuhang He
Xiang Song
Yihong Gong
Xi'an Jiaotong University
Abstract: Diffusion models have revolutionized customized text-to-image generation, allowing for efficient synthesis of photos from personal data with textual descriptions. However, these advancements bring forth risks including privacy breaches and unauthorized replication of artworks. Previous researches primarily center around using “prompt-specific methods” to generate adversarial examples to protect personal images, yet the effectiveness of existing methods is hindered by constrained adaptability to different prompts. In this paper, we introduce a Prompt-Agnostic Adversarial Perturbation (PAP) method for customized diffusion models. PAP first models the prompt distribution using a Laplace Approximation, and then produces prompt-agnostic perturbations by maximizing a disturbance expectation based on the modeled distribution. This approach effectively tackles the prompt-agnostic attacks, leading to improved defense stability. Extensive experiments in face privacy and artistic style protection, demonstrate the superior generalization of PAP in comparison to existing techniques.
Details of algorithms and experimental results can be found in our following paper:
@article{wan2024prompt,
title={Prompt-Agnostic Adversarial Perturbation for Customized Diffusion Models},
author={Wan, Cong and He, Yuhang and Song, Xiang and Gong, Yihong},
journal={arXiv preprint arXiv:2408.10571},
year={2024},
}
Our paper is currently under review, and the code will be available soon. The project page can be found at: https://vancyland.github.io/PAP.github.io/