arxiv:2501.05452
Zhengyuan Yang
zyang39
AI & ML interests
None yet
Recent Activity
authored
a paper
about 3 hours ago
ReFocus: Visual Editing as a Chain of Thought for Structured Image
Understanding
authored
a paper
28 days ago
OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary
Embedding Distillation
upvoted
a
paper
about 1 month ago
OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary
Embedding Distillation
Organizations
Papers
17
models
None public yet
datasets
None public yet