30 35 53

Lin Chen

Lin-Chen

https://lin-chen.site

AI & ML interests

None yet

Recent Activity

authored a paper 9 days ago

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

liked a model 13 days ago

internlm/internlm-xcomposer2d5-ol-7b

upvoted a paper 13 days ago

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

View all activity

Organizations

Lin-Chen's activity

authored a paper 9 days ago

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Paper • 2412.09596 • Published 13 days ago • 90

liked a model 13 days ago

internlm/internlm-xcomposer2d5-ol-7b

Visual Question Answering • Updated 13 days ago • 41

upvoted a paper 13 days ago

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Paper • 2412.09596 • Published 13 days ago • 90

liked a dataset 16 days ago

Tongyi-ConvAI/MMEvol

Preview • Updated 26 days ago • 1.45k • 8

authored a paper 23 days ago

Open-Sora Plan: Open-Source Large Video Generation Model

Paper • 2412.00131 • Published 28 days ago • 32

upvoted a paper 23 days ago

Open-Sora Plan: Open-Source Large Video Generation Model

Paper • 2412.00131 • Published 28 days ago • 32

liked a Space 23 days ago

Running

270

⚡

Qwen2.5 72B Instruct

upvoted a collection about 1 month ago

Qwen2-VL

Collection

Vision-language model series based on Qwen2 • 16 items • Updated 20 days ago • 181

updated a dataset 2 months ago

Lin-Chen/Open-LLaVA-NeXT-mix1M

Updated Oct 25 • 90 • 11

upvoted 3 papers 2 months ago

MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models

Paper • 2410.17637 • Published Oct 23 • 34

SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree

Paper • 2410.16268 • Published Oct 21 • 65

PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction

Paper • 2410.17247 • Published Oct 22 • 45

liked 6 datasets 2 months ago

upvoted a paper 3 months ago

Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration Rate

Paper • 2410.07167 • Published Oct 9 • 37

liked a dataset 3 months ago

AIDC-AI/Ovis-dataset

Preview • Updated Sep 16 • 1.01k • 23