Update README.md
Browse files
README.md
CHANGED
@@ -28,6 +28,15 @@ Our model is being trained on MMEB-train and evaluated on MMEB-eval with contras
|
|
28 |
## Performance
|
29 |
This model outperforms the baselines and previous version of VLM2Vec by a large margin.
|
30 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
31 |

|
32 |
|
33 |
|
|
|
28 |
## Performance
|
29 |
This model outperforms the baselines and previous version of VLM2Vec by a large margin.
|
30 |
|
31 |
+
| Model | Classification | VQA | Retrieval | Grounding | IND | OOD | Overall |
|
32 |
+
|---------------------------------------|---------------|------|-----------|-----------|------|------|---------|
|
33 |
+
| Phi-3.5-V, Full-model fine-tuned (#crop=4) | 52.8 | 50.3 | 57.8 | 72.3 | 62.8 | 47.4 | 55.9 |
|
34 |
+
| Phi-3.5-V, LoRA | 54.8 | 54.9 | 62.3 | 79.5 | 66.5 | 52.0 | 60.1 |
|
35 |
+
| LLaVA-1.6, LoRA | 54.7 | 50.3 | 56.2 | 64.0 | 61.0 | 47.5 | 55.0 |
|
36 |
+
| LLaVA-1.6, LoRA | 61.2 | 49.9 | 67.4 | 86.1 | 67.5 | 57.1 | 62.9 |
|
37 |
+
| Qwen2-VL-2B, LoRA | 59.0 | 49.4 | 65.4 | 73.4 | 66.0 | 52.6 | 60.1 |
|
38 |
+
| **Qwen2-VL-7B, LoRA (this model)** | **62.6** | **57.8** | **69.9** | 81.7 | **72.2** | **57.8** | **65.8** |
|
39 |
+
|
40 |

|
41 |
|
42 |
|