Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
diwank
's Collections
search
Vision
Art
K
S1.1
Sam
Audio
thought
Vision
updated
about 5 hours ago
Upvote
-
apple/DepthPro
Depth Estimation
•
Updated
Oct 9
•
3.05k
•
370
rhymes-ai/Aria
Image-Text-to-Text
•
Updated
8 days ago
•
17.9k
•
600
mit-han-lab/hart-0.7b-1024px
Unconditional Image Generation
•
Updated
Nov 17
•
8
deepseek-ai/Janus-1.3B
Any-to-Any
•
Updated
Nov 14
•
7.77k
•
482
neulab/PangeaInstruct
Updated
Oct 25
•
856
•
78
genmo/mochi-1-preview
Text-to-Video
•
Updated
7 days ago
•
34.3k
•
1.12k
stabilityai/stable-diffusion-3.5-large
Text-to-Image
•
Updated
Oct 22
•
154k
•
•
1.69k
Freepik/flux.1-lite-8B-alpha
Text-to-Image
•
Updated
Oct 28
•
9.87k
•
402
microsoft/OmniParser
Image-Text-to-Text
•
Updated
24 days ago
•
4.33k
•
1.5k
mistralai/Pixtral-12B-Base-2409
Updated
Oct 30
•
69
neulab/Pangea-7B
Updated
Oct 24
•
6.19k
•
122
jadechoghari/Ferret-UI-Llama8b
Image-Text-to-Text
•
Updated
Oct 18
•
1.16k
•
55
OpenGVLab/InternVL2-1B
Image-Text-to-Text
•
Updated
8 days ago
•
55.5k
•
59
OpenGVLab/InternVL2-2B
Image-Text-to-Text
•
Updated
8 days ago
•
55.8k
•
64
OpenGVLab/Mono-InternVL-2B
Image-Text-to-Text
•
Updated
Nov 21
•
5.94k
•
29
OpenGVLab/OmniCorpus-YT
Updated
Nov 17
•
171
•
8
OpenGVLab/OmniCorpus-CC-210M
Viewer
•
Updated
Nov 17
•
208M
•
838
•
19
OpenGVLab/OmniCorpus-CC
Viewer
•
Updated
Nov 17
•
986M
•
22.9k
•
12
OpenGVLab/InternVideo2_chat_8B_HD
Video-Text-to-Text
•
Updated
8 days ago
•
811
•
15
OpenGVLab/ViCLIP
Updated
Jun 7
•
33
OpenGVLab/ASMv2
Text Generation
•
Updated
Feb 29
•
118
•
17
OpenGVLab/VideoChat2-IT
Viewer
•
Updated
Jun 29
•
1.82M
•
782
•
45
NimVideo/cogvideox-2b-img2vid
Image-to-Video
•
Updated
Oct 28
•
435
•
62
BAAI/Infinity-MM
Updated
13 days ago
•
34.1k
•
85
nvidia/RADIO-H
Updated
24 days ago
•
1.8k
•
9
Spawning/PD12M
Viewer
•
Updated
Nov 19
•
12.4M
•
3.19k
•
145
Shitao/OmniGen-v1
Text-to-Image
•
Updated
Nov 7
•
11.9k
•
270
InstantX/InstantIR
Image-to-Image
•
Updated
Nov 7
•
6
•
155
nvidia/Cosmos-Tokenizer-DI8x8
Updated
1 day ago
•
254
•
8
BAAI/Emu3-Chat
Text Generation
•
Updated
Oct 24
•
1.54k
•
71
briaai/RMBG-2.0
Image Segmentation
•
Updated
3 days ago
•
230k
•
534
Watermark Anything with Localized Messages
Paper
•
2411.07231
•
Published
Nov 11
•
20
rain1011/pyramid-flow-miniflux
Text-to-Video
•
Updated
Nov 13
•
153
OpenGVLab/InternVL2-8B-MPO
Image-Text-to-Text
•
Updated
6 days ago
•
3.24k
•
31
mistralai/Pixtral-Large-Instruct-2411
Image-Text-to-Text
•
Updated
20 days ago
•
373
briaai/BRIA-2.3
Text-to-Image
•
Updated
Nov 19
•
608
•
29
microsoft/Reducio-VAE
Updated
Nov 21
•
41
•
15
Lightricks/LTX-Video
Image-to-Video
•
Updated
7 days ago
•
75.5k
•
774
apple/aimv2-3B-patch14-448
Image Feature Extraction
•
Updated
28 days ago
•
343
•
8
THUdyh/Insight-V-Reason
Text Generation
•
Updated
Nov 22
•
50
•
9
black-forest-labs/FLUX.1-Fill-dev
Updated
about 1 month ago
•
65k
•
406
Efficient-Large-Model/Sana_1600M_512px
Text-to-Image
•
Updated
22 days ago
•
1k
•
37
Efficient-Large-Model/Sana_1600M_1024px
Text-to-Image
•
Updated
22 days ago
•
19k
•
146
AIDC-AI/Ovis1.6-Gemma2-27B
Image-Text-to-Text
•
Updated
16 days ago
•
1.39k
•
57
HuggingFaceTB/SmolVLM-Base
Image-Text-to-Text
•
Updated
28 days ago
•
11.7k
•
49
THUDM/glm-edge-v-5b
Image-Text-to-Text
•
Updated
27 days ago
•
352
•
11
rhymes-ai/Aria-Base-64K
Image-Text-to-Text
•
Updated
25 days ago
•
323
•
10
allenai/pixmo-point-explanations
Viewer
•
Updated
20 days ago
•
79.6k
•
553
•
6
tencent/HunyuanVideo
Text-to-Video
•
Updated
8 days ago
•
7.07k
•
1.28k
tencent/HunyuanVideo-PromptRewrite
Updated
20 days ago
•
306
•
36
google/paligemma2-28b-pt-896
Image-Text-to-Text
•
Updated
21 days ago
•
1.02k
•
36
OpenGVLab/InternVL2_5-78B
Image-Text-to-Text
•
Updated
8 days ago
•
4.36k
•
158
MAmmoTH-VL/MAmmoTH-VL-8B
Updated
17 days ago
•
210
•
14
MAmmoTH-VL/MAmmoTH-VL-Instruct-12M
Viewer
•
Updated
15 days ago
•
37M
•
4.56k
•
28
OpenGVLab/PVC-InternVL2-8B
Image-Text-to-Text
•
Updated
9 days ago
•
78
•
8
BGLab/BioTrove
Viewer
•
Updated
12 days ago
•
163M
•
61
•
4
TencentARC/NVComposer
Image-to-3D
•
Updated
10 days ago
•
148
•
6
deepseek-ai/deepseek-vl2
Image-Text-to-Text
•
Updated
8 days ago
•
781
•
103
FastVideo/FastHunyuan
Text-to-Video
•
Updated
8 days ago
•
368
•
113
BAAI/nova-d48w1536-sdxl1024
Text-to-Image
•
Updated
5 days ago
•
27
•
5
IamCreateAI/Ruyi-Mini-7B
Image-to-Video
•
Updated
about 19 hours ago
•
12.8k
•
460
Infinigence/Megrez-3B-Omni
Updated
9 days ago
•
485
•
116
microsoft/VidTok
Updated
about 24 hours ago
•
14
TIGER-Lab/Mantis-8B-siglip-llama3
Image-Text-to-Text
•
Updated
Nov 15
•
4.13k
•
32
OpenGVLab/HoVLE-HD
Image-Text-to-Text
•
Updated
1 day ago
•
48
•
6
Upvote
-
Share collection
View history
Collection guide
Browse collections