Sherman Siu's picture

10 8 1

Sherman Siu

shermansiu

·

shermansiu

AI & ML interests

None yet

Recent Activity

upvoted a paper 16 days ago

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

upvoted a paper 16 days ago

MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale

new activity 26 days ago

shermansiu/dm_graphcast:Can the evaluation data in the paper be reproduced?

View all activity

Organizations

None yet

shermansiu's activity

upvoted 2 papers 16 days ago

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published 19 days ago • 121

MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale

Paper • 2412.05237 • Published 19 days ago • 46

upvoted a paper about 1 month ago

OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision

Paper • 2411.07199 • Published Nov 11 • 45

upvoted a paper 2 months ago

MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks

Paper • 2410.10563 • Published Oct 14 • 38

upvoted a paper 6 months ago

MantisScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation

Paper • 2406.15252 • Published Jun 21 • 14

upvoted 3 papers 8 months ago

XC-Cache: Cross-Attending to Cached Context for Efficient LLM Inference

Paper • 2404.15420 • Published Apr 23 • 7

PuLID: Pure and Lightning ID Customization via Contrastive Alignment

Paper • 2404.16022 • Published Apr 24 • 21

Editable Image Elements for Controllable Synthesis

Paper • 2404.16029 • Published Apr 24 • 10