Continuous Layout Editing of Single Images with Diffusion Models

Zhang, Zhiyuan; Huang, Zhitong; Liao, Jing

Continuous Layout Editing of Single Images with Diffusion Models

dc.contributor.author	Zhang, Zhiyuan	en_US
dc.contributor.author	Huang, Zhitong	en_US
dc.contributor.author	Liao, Jing	en_US
dc.contributor.editor	Chaine, Raphaëlle	en_US
dc.contributor.editor	Deng, Zhigang	en_US
dc.contributor.editor	Kim, Min H.	en_US
dc.date.accessioned	2023-10-09T07:36:13Z
dc.date.available	2023-10-09T07:36:13Z
dc.date.issued	2023
dc.description.abstract	Recent advancements in large-scale text-to-image diffusion models have enabled many applications in image editing. However, none of these methods have been able to edit the layout of single existing images. To address this gap, we propose the first framework for layout editing of a single image while preserving its visual properties, thus allowing for continuous editing on a single image. Our approach is achieved through two key modules. First, to preserve the characteristics of multiple objects within an image, we disentangle the concepts of different objects and embed them into separate textual tokens using a novel method called masked textual inversion. Next, we propose a training-free optimization method to perform layout control for a pre-trained diffusion model, which allows us to regenerate images with learned concepts and align them with user-specified layouts. As the first framework to edit the layout of existing images, we demonstrate that our method is effective and outperforms other baselines that were modified to support this task. Code is available at our project page.	en_US
dc.description.number	7
dc.description.sectionheaders	Image Editing and Color
dc.description.seriesinformation	Computer Graphics Forum
dc.description.volume	42
dc.identifier.doi	10.1111/cgf.14966
dc.identifier.issn	1467-8659
dc.identifier.pages	11 pages
dc.identifier.uri	https://doi.org/10.1111/cgf.14966
dc.identifier.uri	https://diglib.eg.org:443/handle/10.1111/cgf14966
dc.publisher	The Eurographics Association and John Wiley & Sons Ltd.	en_US
dc.subject	CCS Concepts: Computing methodologies -> Image manipulation; Graphics systems and interfaces; Neural networks
dc.subject	Computing methodologies
dc.subject	Image manipulation
dc.subject	Graphics systems and interfaces
dc.subject	Neural networks
dc.title	Continuous Layout Editing of Single Images with Diffusion Models	en_US

Files

Original bundle

Now showing 1 - 2 of 2

Name:: v42i7_38_14966.pdf
Size:: 31.31 MB
Format:: Adobe Portable Document Format

Download

Name:: paper1105_mm.pdf
Size:: 44.38 MB
Format:: Adobe Portable Document Format

Download

Collections

42-Issue 7