VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models Paper β’ 2411.13503 β’ Published Nov 20 β’ 30
M-Longdoc: A Benchmark For Multimodal Super-Long Document Understanding And A Retrieval-Aware Tuning Framework Paper β’ 2411.06176 β’ Published Nov 9 β’ 44
Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss Paper β’ 2410.17243 β’ Published Oct 22 β’ 89
VideoBooth: Diffusion-based Video Generation with Image Prompts Paper β’ 2312.00777 β’ Published Dec 1, 2023 β’ 21
FreeInit: Bridging Initialization Gap in Video Diffusion Models Paper β’ 2312.07537 β’ Published Dec 12, 2023 β’ 25
FreeInit: Bridging Initialization Gap in Video Diffusion Models Paper β’ 2312.07537 β’ Published Dec 12, 2023 β’ 25
VideoBooth: Diffusion-based Video Generation with Image Prompts Paper β’ 2312.00777 β’ Published Dec 1, 2023 β’ 21
LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models Paper β’ 2309.15103 β’ Published Sep 26, 2023 β’ 42