Model Card for VideoChat2-HD-hf

This modelcard aims to give the model info of 'MVBench: A Comprehensive Multi-modal Video Understanding Benchmark'.

Model Details

🛠Usage

Check the Jupyter Demo.

📃Model Sources

✏️Citation

If you find this work useful for your research, please consider citing VideoChat2. Your acknowledgement would greatly help us in continuing to contribute resources to the research community.

@article{li2023videochat,
  title={VideoChat: Chat-Centric Video Understanding},
  author={KunChang Li, Yinan He, Yi Wang, Yizhuo Li, Wenhai Wang, Ping Luo, Yali Wang, Limin Wang, and Yu Qiao},
  journal={arXiv preprint arXiv:2305.06355},
  year={2023}
}

@misc{li2023mvbench,
      title={MVBench: A Comprehensive Multi-modal Video Understanding Benchmark}, 
      author={Kunchang Li and Yali Wang and Yinan He and Yizhuo Li and Yi Wang and Yi Liu and Zun Wang and Jilan Xu and Guo Chen and Ping Luo and Limin Wang and Yu Qiao},
      year={2023},
      eprint={2311.17005},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}
Downloads last month
1,445
Safetensors
Model size
7.72B params
Tensor type
I64
·
F32
·
FP16
·
Inference API
Unable to determine this model's library. Check the docs .

Model tree for OpenGVLab/VideoChat2_HD_stage4_Mistral_7B_hf

Finetuned
(375)
this model

Collection including OpenGVLab/VideoChat2_HD_stage4_Mistral_7B_hf