-
Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction
Paper • 2409.18124 • Published • 32 -
LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D-awareness
Paper • 2409.18125 • Published • 33 -
Efficient Diffusion Models: A Comprehensive Survey from Principles to Practices
Paper • 2410.11795 • Published • 16 -
An Introduction to Vision-Language Modeling
Paper • 2405.17247 • Published • 87
Xuejian Rong
xrong
AI & ML interests
None yet
Recent Activity
liked
a model
29 days ago
HuggingFaceTB/SmolVLM-Instruct-DPO
upvoted
a
collection
29 days ago
SmolVLM
Organizations
None yet
Collections
11
models
None public yet
datasets
None public yet