ShareGPTVideo

university

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

ruohongz updated a dataset 11 days ago

ShareGPTVideo/train_video_and_instruction

ruohongz updated a dataset about 2 months ago

ShareGPTVideo/train_raw_video

ruohongz updated a model about 2 months ago

ShareGPTVideo/LLaVA-Hound-DPO

View all activity

ShareGPTVideo's activity

ruohongz

updated a dataset 11 days ago

ShareGPTVideo/train_video_and_instruction

Updated 11 days ago • 1.46k • 20

ruohongz

updated a dataset about 2 months ago

ShareGPTVideo/train_raw_video

Viewer • Updated Oct 31 • 64.1k • 141 • 1

ruohongz

updated 3 models about 2 months ago

ruohongz

authored 4 papers 2 months ago

Improve Vision Language Model Chain-of-thought Reasoning

Paper • 2410.16198 • Published Oct 21 • 22

SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents

Paper • 2310.11667 • Published Oct 18, 2023 • 2

A Self-enhancement Approach for Domain-specific Chatbot Training via Knowledge Mining and Digest

Paper • 2311.10614 • Published Nov 17, 2023

Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward

Paper • 2404.01258 • Published Apr 1 • 10

ZhangYuanhan

authored a paper 3 months ago

Video Instruction Tuning With Synthetic Data

Paper • 2410.02713 • Published Oct 3 • 38

ZhangYuanhan

authored 9 papers 5 months ago

LLaVA-OneVision: Easy Visual Task Transfer

Paper • 2408.03326 • Published Aug 6 • 59

FunQA: Towards Surprising Video Comprehension

Paper • 2306.14899 • Published Jun 26, 2023 • 1

MMBench: Is Your Multi-modal Model an All-around Player?

Paper • 2307.06281 • Published Jul 12, 2023 • 5

Neural Prompt Search

Paper • 2206.04673 • Published Jun 9, 2022

VBench: Comprehensive Benchmark Suite for Video Generative Models

Paper • 2311.17982 • Published Nov 29, 2023 • 7

Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward

Paper • 2404.01258 • Published Apr 1 • 10

Learning without Forgetting for Vision-Language Models

Paper • 2305.19270 • Published May 30, 2023

Bamboo: Building Mega-Scale Vision Dataset Continually with Human-Machine Synergy

Paper • 2203.07845 • Published Mar 15, 2022

LMMs-Eval: Reality Check on the Evaluation of Large Multimodal Models

Paper • 2407.12772 • Published Jul 17 • 33

ZhangYuanhan

authored a paper 6 months ago

LLaVA-NeXT-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Models

Paper • 2407.07895 • Published Jul 10 • 40

AI & ML interests

Recent Activity

Team members 4

ShareGPTVideo's activity