Papers
arxiv:2502.01639

SliderSpace: Decomposing the Visual Capabilities of Diffusion Models

Published on Feb 3
ยท Submitted by RohitGandikota on Feb 4
Authors:
,
,

Abstract

We present SliderSpace, a framework for automatically decomposing the visual capabilities of diffusion models into controllable and human-understandable directions. Unlike existing control methods that require a user to specify attributes for each edit direction individually, SliderSpace discovers multiple interpretable and diverse directions simultaneously from a single text prompt. Each direction is trained as a low-rank adaptor, enabling compositional control and the discovery of surprising possibilities in the model's latent space. Through extensive experiments on state-of-the-art diffusion models, we demonstrate SliderSpace's effectiveness across three applications: concept decomposition, artistic style exploration, and diversity enhancement. Our quantitative evaluation shows that SliderSpace-discovered directions decompose the visual structure of model's knowledge effectively, offering insights into the latent capabilities encoded within diffusion models. User studies further validate that our method produces more diverse and useful variations compared to baselines. Our code, data and trained weights are available at https://sliderspace.baulab.info

Community

Paper author Paper submitter
โ€ข
edited about 11 hours ago

We propose an unsupervised method to discover creative slider directions by unlocking the model's knowledge about a concept.

All you need to provide is a single prompt "toy" and you can discover 100s of sliders

image.png

These are the directions that diffusion model thinks is creative and interesting about a "toy".

Now you can also use this for exploration of model's knowledge. For instance here are some directions of art styles from SDXL that we explored using a single prompt "art in the style of a famous artist"

image.png

You can discover directions in any model using SliderSpace - SDv1.4, SDXL, SDXL-Turbo (DMD, Lightning, LCM), FLUX, Pix-art, .... (many more). It's time to unlock the model's creativity and explore!

This is awesome!

Sign up or log in to comment

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2502.01639 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2502.01639 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2502.01639 in a Space README.md to link it from this page.

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.