arxiv:2403.12585

LASPA: Latent Spatial Alignment for Fast Training-free Single Image Editing

Published on Mar 19, 2024

Authors:

Abstract

We present a novel, training-free approach for textual editing of real images using diffusion models. Unlike prior methods that rely on computationally expensive finetuning, our approach leverages LAtent SPatial Alignment (LASPA) to efficiently preserve image details. We demonstrate how the diffusion process is amenable to spatial guidance using a reference image, leading to semantically coherent edits. This eliminates the need for complex optimization and costly model finetuning, resulting in significantly faster editing compared to previous methods. Additionally, our method avoids the storage requirements associated with large finetuned models. These advantages make our approach particularly well-suited for editing on mobile devices and applications demanding rapid response times. While simple and fast, our method achieves 62-71\% preference in a user-study and significantly better model-based editing strength and image preservation scores.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment

No model linking this paper

Cite arxiv.org/abs/2403.12585 in a model README.md to link it from this page.

No dataset linking this paper

Cite arxiv.org/abs/2403.12585 in a dataset README.md to link it from this page.

No Space linking this paper

Cite arxiv.org/abs/2403.12585 in a Space README.md to link it from this page.

No Collection including this paper

Add this paper to a collection to link it from this page.