MUDDFormer: Breaking Residual Bottlenecks in Transformers via Multiway Dynamic Dense Connections Paper • 2502.12170 • Published 24 days ago • 12
You Do Not Fully Utilize Transformer's Representation Capacity Paper • 2502.09245 • Published 24 days ago • 34
EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling Paper • 2502.09509 • Published 24 days ago • 7
Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening Paper • 2502.12146 • Published 20 days ago • 16
I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models Paper • 2502.10458 • Published 25 days ago • 30
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention Paper • 2502.11089 • Published 21 days ago • 141
MRS: A Fast Sampler for Mean Reverting Diffusion based on ODE and SDE Solvers Paper • 2502.07856 • Published 26 days ago • 4
Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model Paper • 2502.10248 • Published 23 days ago • 51
VFX Creator: Animated Visual Effect Generation with Controllable Diffusion Transformer Paper • 2502.05979 • Published 28 days ago • 8
Skrr: Skip and Re-use Text Encoder Layers for Memory Efficient Text-to-Image Generation Paper • 2502.08690 • Published 25 days ago • 41
Jakiro: Boosting Speculative Decoding with Decoupled Multi-Head via MoE Paper • 2502.06282 • Published 27 days ago • 5
Efficient-vDiT: Efficient Video Diffusion Transformers With Attention Tile Paper • 2502.06155 • Published 27 days ago • 9
CustomVideoX: 3D Reference Attention Driven Dynamic Adaptation for Zero-Shot Customized Video Diffusion Transformers Paper • 2502.06527 • Published 27 days ago • 10
FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation Paper • 2502.05179 • Published 30 days ago • 24
VideoRoPE: What Makes for Good Video Rotary Position Embedding? Paper • 2502.05173 • Published 30 days ago • 64
Goku: Flow Based Video Generative Foundation Models Paper • 2502.04896 • Published about 1 month ago • 95