mlx-community/nanoLLaVA-1.5-4bit

This model was converted to MLX format from qnguyen3/nanoLLaVA-1.5 using mlx-vlm version 0.0.11. Refer to the original model card for more details on the model.

Use with mlx

pip install -U mlx-vlm
python -m mlx_vlm.generate --model mlx-community/nanoLLaVA-1.5-4bit --max-tokens 100 --temp 0.0
Downloads last month
17
Safetensors
Model size
166M params
Tensor type
FP16
U32
F32
Inference Examples
Inference API (serverless) does not yet support mlx models for this pipeline type.