arxiv:2412.14171
Jihan Yang
jihanyang
AI & ML interests
Computer Vision, Multimodality, Embodied AI
Recent Activity
liked
a dataset
5 days ago
nyu-visionx/VSI-Bench
authored
a paper
5 days ago
Thinking in Space: How Multimodal Large Language Models See, Remember,
and Recall Spaces