GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control Paper β’ 2503.03751 β’ Published 4 days ago β’ 18 β’ 4
DynVFX: Augmenting Real Videos with Dynamic Content Paper β’ 2502.03621 β’ Published Feb 5 β’ 29 β’ 3
Where Visual Speech Meets Language: VSP-LLM Framework for Efficient and Context-Aware Visual Speech Processing Paper β’ 2402.15151 β’ Published Feb 23, 2024 β’ 7 β’ 2
AfroDigits: A Community-Driven Spoken Digit Dataset for African Languages Paper β’ 2303.12582 β’ Published Mar 22, 2023 β’ 20 β’ 3