Towards the Aha Moment of Vision-Language Models
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/MV5fZ-34eISj5owCf11W1.png)
Multi-modal Multilingual Instruction
university
AI & ML interests
None defined yet.
Recent Activity
View all activity
Collections
1
models
9
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/MV5fZ-34eISj5owCf11W1.png)
MMInstruction/Qwen2-VL-72B-Video-T3
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/MV5fZ-34eISj5owCf11W1.png)
MMInstruction/Giraffe
Updated
•
12
•
2
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/MV5fZ-34eISj5owCf11W1.png)
MMInstruction/LongVA-7B-Video-T3
Updated
•
2
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/MV5fZ-34eISj5owCf11W1.png)
MMInstruction/Qwen-VL-ArXivCap
Text Generation
•
Updated
•
21
•
4
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/MV5fZ-34eISj5owCf11W1.png)
MMInstruction/Qwen-VL-ArXivQA
Text Generation
•
Updated
•
27
•
4
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/MV5fZ-34eISj5owCf11W1.png)
MMInstruction/Silkie
Text Generation
•
Updated
•
37
•
12
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/MV5fZ-34eISj5owCf11W1.png)
MMInstruction/YingVLM
Updated
•
19
•
1
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/MV5fZ-34eISj5owCf11W1.png)
MMInstruction/YingVLM-zh
Updated
•
6
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/MV5fZ-34eISj5owCf11W1.png)
MMInstruction/YingVLM-Video
Updated
•
8
datasets
14
MMInstruction/Clevr_CoGenT_TrainA_R1
Viewer
•
Updated
•
37.8k
•
907
•
27
MMInstruction/SuperClevr_Val
Viewer
•
Updated
•
5k
•
48
MMInstruction/Clevr_CoGenT_TrainA_70K_Complex
Viewer
•
Updated
•
70k
•
53
MMInstruction/Clevr_CoGenT_ValB
Viewer
•
Updated
•
5k
•
81
•
1
MMInstruction/Clevr_CoGenT_ValA
Viewer
•
Updated
•
5k
•
68
MMInstruction/Clevr_CoAgent_TrainA_R1
Viewer
•
Updated
•
2.5k
•
41
MMInstruction/VL-RewardBench
Viewer
•
Updated
•
1.25k
•
542
•
5
MMInstruction/RedTeamingVLM
Updated
•
2.1k
•
14
MMInstruction/VLFeedback
Viewer
•
Updated
•
80.3k
•
561
•
45
MMInstruction/ArxivCap
Viewer
•
Updated
•
573k
•
3.61k
•
50