mlx-community/llava-interleave-qwen-0.5b-8bit

This model was converted to MLX format from llava-hf/llava-interleave-qwen-0.5b-hf using mlx-vlm version 0.0.15. Refer to the original model card for more details on the model.

Use with mlx

pip install -U mlx-vlm
python -m mlx_vlm.generate --model mlx-community/llava-interleave-qwen-0.5b-8bit --max-tokens 100 --temp 0.0
Downloads last month
10
Safetensors
Model size
245M params
Tensor type
FP16
U32
F32
Inference Examples
Inference API (serverless) does not yet support mlx models for this pipeline type.