The MMStar's test outcomes are falling short: 0.47

#93

by DUNDUN2 - opened 1 day ago

1 day ago

Hi there, I've been utilizing vlmevalkit to evaluate the post-training model. All other benchmarks seem to be in order, except for the MMStar's score which is on the lower side. Could you possibly shed some light on why this might be the case?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment