Lin-Chen/ShareGPT4V-13B · Question on the two variants

Dec 21, 2023

You provided two different fine tunes for 13B, both are very differently named (one pretrained vicuna, and this one)
The readme lists less training data for the vicuna upload, the configuration shows "square" as aspect ratio, does that mean the image is cropped to square instead of padding in this case ?

What is the purpose of the other finetune ?
I'm just testing both, the vicuna one appears to hallucinate a tiny bit more than the previous 7B variant and it does not follow instructions as good as I was used to.

Lin-Chen

Owner Dec 21, 2023

You provided two different fine tunes for 13B, both are very differently named (one pretrained vicuna, and this one)
The readme lists less training data for the vicuna upload, the configuration shows "square" as aspect ratio, does that mean the image is cropped to square instead of padding in this case ?

What is the purpose of the other finetune ?
I'm just testing both, the vicuna one appears to hallucinate a tiny bit more than the previous 7B variant and it does not follow instructions as good as I was used to.

Sorry for the confusion. The vicuna one is LLM only after the pretain stage.

cmp-nct

Dec 21, 2023

Thanks a lot, that explains it.
I tested the other one and it does not have the issues anymore and is better in listing fine details than the 7B one.
However - hallucinations increased also

Lin-Chen

Owner Dec 22, 2023

Thanks a lot, that explains it.
I tested the other one and it does not have the issues anymore and is better in listing fine details than the 7B one.
However - hallucinations increased also

Thanks for your feedback! We will make some efforts to decrease the hallucinations in future work.

Lin-Chen changed discussion status to closed Dec 27, 2023