Shijia Yang's picture

1 5

Shijia Yang

shijiay

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?

authored a paper about 1 month ago

AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?

commented a paper 4 months ago

Law of Vision Representation in MLLMs

View all activity

Organizations

None yet

Papers 4

arxiv:2412.02611

arxiv:2408.16357

arxiv:2310.01779

arxiv:2211.11720

models 27

shijiay/llava_clip224_stage1

Image-Text-to-Text • Updated Sep 3, 2024 • 18

shijiay/llava_clip224_stage2

Image-Text-to-Text • Updated Sep 3, 2024 • 25

shijiay/llava_dinov2_stage2

Image-Text-to-Text • Updated Sep 3, 2024 • 18 • 1

shijiay/llava_clip_stage1

Image-Text-to-Text • Updated Sep 3, 2024 • 15

shijiay/llava_clip_stage2

Image-Text-to-Text • Updated Sep 3, 2024 • 43

shijiay/llava_openclip_stage1

Image-Text-to-Text • Updated Sep 3, 2024 • 9

shijiay/llava_openclip_stage2

Image-Text-to-Text • Updated Sep 3, 2024 • 11

shijiay/llava_siglip_stage1

Image-Text-to-Text • Updated Sep 3, 2024 • 15

shijiay/llava_siglip_stage2

Image-Text-to-Text • Updated Sep 3, 2024 • 23

shijiay/llava_sdim_stage1

Image-Text-to-Text • Updated Sep 3, 2024 • 7

datasets

None public yet