Unified Reward Model for Multimodal Understanding and Generation Paper • 2503.05236 • Published 3 days ago • 44
ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation Paper • 2502.18364 • Published 12 days ago • 32
CineMaster: A 3D-Aware and Controllable Framework for Cinematic Text-to-Video Generation Paper • 2502.08639 • Published 25 days ago • 37
CGB-DM: Content and Graphic Balance Layout Generation with Transformer-based Diffusion Model Paper • 2407.15233 • Published Jul 21, 2024 • 6