Shot categorizer Collection Fine-tune of Florence-2 to generate shot categories, useful for data curation. Code: https://github.com/huggingface/movie-shot-categorizer. • 3 items • Updated 3 days ago • 2
video-effects datasets Collection Smol datasets to emulate cool video effects like "squish", "dissolve", etc. Inspired by Pika effects. • 4 items • Updated Jan 28 • 2
Stretching Each Dollar: Diffusion Training from Scratch on a Micro-Budget Paper • 2407.15811 • Published Jul 22, 2024 • 2
AI Tools for Art - Feb '25 Collection Tools & models from the 2nd issue of AI Tools for Art 🎉 Read more about February's releases: https://open.substack.com/pub/multimodalaiart • 18 items • Updated 9 days ago • 1
Remote VAE Inference Endpoints Collection Models and handler code used in https://huggingface.co./blog/remote_vae • 4 items • Updated 12 days ago • 4
AnyText2: Visual Text Generation and Editing With Customizable Attributes Paper • 2411.15245 • Published Nov 22, 2024 • 1
SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features Paper • 2502.14786 • Published 17 days ago • 128
Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps Paper • 2501.09732 • Published Jan 16 • 70
video-effects Collection Fine-tunes of open video generation models like CogVideoX to emulate cool video effects like "squish", "dissolve", "cakeify", etc. Pika inspired. • 8 items • Updated 1 day ago • 5
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets Paper • 2311.15127 • Published Nov 25, 2023 • 13