MUMU: Bootstrapping Multimodal Image Generation from Text-to-Image Data Paper โข 2406.18790 โข Published Jun 26 โข 33
Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model Paper โข 2404.09967 โข Published Apr 15 โข 20
Controllable Music Production with Diffusion Models and Guidance Gradients Paper โข 2311.00613 โข Published Nov 1, 2023 โข 25
Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation Paper โข 2306.07954 โข Published Jun 13, 2023 โข 112