Molmo Collection Artifacts for open multimodal language models. โข 5 items โข Updated Sep 26 โข 270
LLaVa-1.5 Collection LLaVa-1.5 is a series of vision-language models (VLMs) trained on a variety of visual instruction datasets. โข 3 items โข Updated Mar 18 โข 7