10 6 4

Haobo Yuan

HarborYuan

https://yuanhaobo.me

AI & ML interests

computer vision

Recent Activity

new activity 7 days ago

ByteDance/Sa2VA-1B:ValueError due to Mismatch in Tensor Shapes when Loading Model

updated a dataset 7 days ago

Dense-World/Sa2VA-Training

liked a dataset 9 days ago

Dense-World/Sa2VA-Training

View all activity

Organizations

HarborYuan's activity

New activity in ByteDance/Sa2VA-1B 7 days ago

ValueError due to Mismatch in Tensor Shapes when Loading Model

#3 opened 12 days ago by

Nikuson

updated a dataset 7 days ago

Dense-World/Sa2VA-Training

Updated 7 days ago • 67 • 3

liked a dataset 9 days ago

Dense-World/Sa2VA-Training

Updated 7 days ago • 67 • 3

updated 2 models 10 days ago

Dense-World/Sa2VA-26B

Updated 10 days ago • 4

Dense-World/Sa2VA-1B

Updated 10 days ago • 4

published 2 models 10 days ago

Dense-World/Sa2VA-1B

Updated 10 days ago • 4

Dense-World/Sa2VA-26B

Updated 10 days ago • 4

updated a dataset 11 days ago

HarborYuan/omgseg_data

Updated 11 days ago • 16 • 1

New activity in ByteDance/Sa2VA-4B 13 days ago

Issue when running inference with the 4B model

#3 opened 14 days ago by

armandal

updated a dataset 16 days ago

HarborYuan/vid_ref_seg_benchmark

Preview • Updated 16 days ago • 84

authored 2 papers 19 days ago

LLAVADI: What Matters For Multimodal Large Language Models Distillation

Paper • 2407.19409 • Published Jul 28, 2024

Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Paper • 2501.04001 • Published 20 days ago • 42

upvoted a collection 19 days ago

Sa2VA model zoo

Collection

4 items • Updated 13 days ago • 28

upvoted a paper 19 days ago

Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Paper • 2501.04001 • Published 20 days ago • 42

liked a model 20 days ago

ByteDance/Sa2VA-4B

Image-Text-to-Text • Updated 13 days ago • 3.55k • 58

upvoted a paper about 2 months ago

DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation

Paper • 2412.07589 • Published Dec 10, 2024 • 45

updated a model about 2 months ago

Dense-World/Sa2VA-4B

Image-Text-to-Text • Updated 21 days ago • 19

updated a dataset 3 months ago

Dense-World/video-res

Viewer • Updated Nov 4, 2024 • 2.47k • 18

updated a model 4 months ago

HarborYuan/ovsam_models

Mask Generation • Updated Sep 30, 2024 • 3

New activity in HarborYuan/ovsam_models 4 months ago

Add model card

#1 opened 4 months ago by

nielsr