Abstract
Diffusion models have made significant breakthroughs in image, audio, and video generation, but they depend on an iterative generation process that causes slow sampling speed and caps their potential for real-time applications. To overcome this limitation, we propose consistency models, a new family of generative models that achieve high sample quality without adversarial training. They support fast one-step generation by design, while still allowing for few-step sampling to trade compute for sample quality. They also support zero-shot data editing, like image inpainting, colorization, and super-resolution, without requiring explicit training on these tasks. Consistency models can be trained either as a way to distill pre-trained diffusion models, or as standalone generative models. Through extensive experiments, we demonstrate that they outperform existing distillation techniques for diffusion models in one- and few-step generation. For example, we achieve the new state-of-the-art FID of 3.55 on CIFAR-10 and 6.20 on ImageNet 64x64 for one-step generation. When trained as standalone generative models, consistency models also outperform single-step, non-adversarial generative models on standard benchmarks like CIFAR-10, ImageNet 64x64 and LSUN 256x256.
Community
Pretty cool approach!
Currently known open replications:
- https://github.com/cloneofsimo/consistency_models
- https://twitter.com/RiversHaveWings/status/1634038603247661062
Does anyone know of any other one?
I have implemented a library for training consistency models. https://github.com/Kinyugo/consistency_models
OpenAI open sourced their implementation as well! https://github.com/openai/consistency_models
Model is being merged to Diffusers as well and will be in the next 0.18.0 release!
https://github.com/huggingface/diffusers/pull/3492
Unleashing Speed: Consistency Models for Fast Generative AI
Links ๐:
๐ Subscribe: https://www.youtube.com/@Arxflix
๐ Twitter: https://x.com/arxflix
๐ LMNT (Partner): https://lmnt.com/
Models citing this paper 20
Browse 20 models citing this paperDatasets citing this paper 0
No dataset linking this paper