ColPhi3.5

This model was trained from scratch on the data_dir/colpali_train_set dataset.

Model description

ColPhi3.5 is a model based on a novel model architecture and training strategy based on Vision Language Models (VLMs) to efficiently index documents from their visual features. It is a Phi3.5-V-4B extension that generates ColBERT- style multi-vector representations of text and images. It was introduced in the paper ColPali: Efficient Document Retrieval with Vision Language Models.

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

yydxlv
/

colphi3.5

ColPhi3.5

Model description

Intended uses & limitations

Training and evaluation data

Model tree for yydxlv/colphi3.5

Dataset used to train yydxlv/colphi3.5

Evaluation results