20 59 26

HAODONG DUAN

KennyUTC

https://kennymckormick.github.io

AI & ML interests

Video Understanding; Multi-Modal Learning

Recent Activity

upvoted a paper 4 days ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

authored a paper 8 days ago

MM-IFEngine: Towards Multimodal Instruction Following

upvoted a paper 8 days ago

VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning

View all activity

Organizations

KennyUTC's activity

upvoted a paper 4 days ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published 4 days ago • 223

authored a paper 8 days ago

MM-IFEngine: Towards Multimodal Instruction Following

Paper • 2504.07957 • Published 8 days ago • 31

upvoted 2 papers 8 days ago

VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning

Paper • 2504.07956 • Published 8 days ago • 43

MM-IFEngine: Towards Multimodal Instruction Following

Paper • 2504.07957 • Published 8 days ago • 31

upvoted a paper 10 days ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published 11 days ago • 161

updated a dataset 11 days ago

VLMEval/OpenVLMRecords

Updated 11 days ago • 764 • 6

authored a paper 15 days ago

Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing

Paper • 2504.02826 • Published 15 days ago • 67

upvoted a paper 15 days ago

Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing

Paper • 2504.02826 • Published 15 days ago • 67

commented a paper 15 days ago

Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing

Paper • 2504.02826 • Published 15 days ago • 67 •

authored a paper 23 days ago

LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?

Paper • 2503.19990 • Published 24 days ago • 33

upvoted a paper 23 days ago

LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?

Paper • 2503.19990 • Published 24 days ago • 33

commented a paper 23 days ago

LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?

Paper • 2503.19990 • Published 24 days ago • 33 •

liked 3 datasets 24 days ago

authored a paper about 1 month ago

Creation-MMBench: Assessing Context-Aware Creative Intelligence in MLLM

Paper • 2503.14478 • Published Mar 18 • 44

upvoted 2 papers about 1 month ago

Creation-MMBench: Assessing Context-Aware Creative Intelligence in MLLM

Paper • 2503.14478 • Published Mar 18 • 44

VisualPRM: An Effective Process Reward Model for Multimodal Reasoning

Paper • 2503.10291 • Published Mar 13 • 34

authored a paper about 1 month ago

VisualPRM: An Effective Process Reward Model for Multimodal Reasoning

Paper • 2503.10291 • Published Mar 13 • 34

upvoted a paper about 1 month ago

Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published Mar 7 • 118