Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference status
Reset Inference status
Warm
Cold
Frozen
Misc
Reset Misc
multimodal
Inference Endpoints
text-generation-inference
AutoTrain Compatible
custom_code
4-bit precision
Merge
Eval Results
8-bit precision
Mixture of Experts
Misc with no match
text-embeddings-inference
Carbon Emissions
Apply filters
Models
369
Full-text search
Edit filters
Sort: Trending
Active filters:
multimodal
Clear all
Qwen/Qwen2-VL-2B-Instruct-GPTQ-Int4
Image-Text-to-Text
•
Updated
Sep 21
•
10.3k
•
18
Qwen/Qwen2-VL-2B-Instruct-GPTQ-Int8
Image-Text-to-Text
•
Updated
Sep 21
•
3.68k
•
11
lmms-lab/LLaVA-Video-72B-Qwen2
Text Generation
•
Updated
Oct 25
•
1.37k
•
16
openvla/openvla-7b-finetuned-libero-10
Image-Text-to-Text
•
Updated
Oct 9
•
1.65k
•
1
Qwen/Qwen2-VL-7B
Image-Text-to-Text
•
Updated
19 days ago
•
8.65k
•
21
lmms-lab/llava-onevision-qwen2-72b-ov-chat
Image-Text-to-Text
•
Updated
Oct 9
•
7.64k
•
7
mistral-community/pixtral-12b-240910
Image-Text-to-Text
•
Updated
Oct 1
•
383
PKU-Alignment/AA-chameleon-7b-base
Any-to-Any
•
Updated
Sep 13
•
23
•
8
PKU-Alignment/AA-chameleon-7b-plus
Any-to-Any
•
Updated
Sep 13
•
45
•
4
lmms-lab/llava-onevision-qwen2-7b-ov-chat
Text Generation
•
Updated
Oct 23
•
3.24k
•
16
lmms-lab/LLaVA-Video-7B-Qwen2-Video-Only
Text Generation
•
Updated
Oct 4
•
4.41k
•
3
Qwen/Qwen2-VL-72B-Instruct-GPTQ-Int4
Image-Text-to-Text
•
Updated
Sep 24
•
172k
•
16
Qwen/Qwen2-VL-72B-Instruct-GPTQ-Int8
Image-Text-to-Text
•
Updated
Sep 24
•
2.17k
•
7
erax-ai/EraX-VL-7B-V1.0
Visual Question Answering
•
Updated
18 days ago
•
3.65k
•
28
allenai/MolmoE-1B-0924
Image-Text-to-Text
•
Updated
Oct 10
•
76.4k
•
133
allenai/Molmo-7B-O-0924
Image-Text-to-Text
•
Updated
Nov 15
•
78.4k
•
148
unsloth/Llama-3.2-11B-Vision
Image-Text-to-Text
•
Updated
Nov 22
•
3.7k
•
28
unsloth/Llama-3.2-11B-Vision-bnb-4bit
Image-Text-to-Text
•
Updated
Nov 22
•
642
•
14
unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit
Image-Text-to-Text
•
Updated
16 days ago
•
105k
•
62
unsloth/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text
•
Updated
16 days ago
•
40.4k
•
65
unsloth/Llama-3.2-90B-Vision-Instruct
Image-Text-to-Text
•
Updated
Nov 22
•
1.44k
•
15
unsloth/Llama-3.2-90B-Vision-Instruct-bnb-4bit
Image-Text-to-Text
•
Updated
Nov 22
•
2.04k
•
16
mlx-community/Qwen2-VL-2B-Instruct-4bit
Image-Text-to-Text
•
Updated
Sep 27
•
490
•
2
mlx-community/Qwen2-VL-7B-Instruct-bf16
Image-Text-to-Text
•
Updated
Sep 28
•
134
•
4
mlx-community/Qwen2-VL-7B-Instruct-8bit
Image-Text-to-Text
•
Updated
Sep 28
•
200
•
2
mlx-community/Qwen2-VL-7B-4bit
Image-Text-to-Text
•
Updated
Sep 28
•
60
•
2
mlx-community/Qwen2-VL-7B-bf16
Image-Text-to-Text
•
Updated
Sep 28
•
38
•
1
mlx-community/Qwen2-VL-2B-4bit
Image-Text-to-Text
•
Updated
Sep 28
•
80
•
4
kiddobellamy/Llama_Vision
Video-Text-to-Text
•
Updated
Sep 28
•
23
•
1
Neurazum/Xbai-Epilepsy-1.0
Video-Text-to-Text
•
Updated
Nov 11
•
2
Previous
1
2
3
4
5
...
13
Next