--- license: apache-2.0 ---

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection