This is a repo for FaDianInternVL2-2B model, it finetune with dataset CLoT-Oogiri-GO.
We split all the I2T datasets and filter, the final datasets is 24000+. Then we split it to 3000 images and QLoRA it.
first
second