One Vision-Language-Action Model for GUI Agent
Qinghong (Kevin) Lin PRO
KevinQHLin
AI & ML interests
Vision-Language Model, Video Understanding, Human-AI Interaction
Recent Activity
upvoted
a
paper
6 days ago
Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models
new activity
6 days ago
showlab/ShowUI-web:Missing Images