Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
microsoft
/
OmniParser
like
1.5k
Follow
Microsoft
6.37k
Image-Text-to-Text
Transformers
Safetensors
blip-2
visual-question-answering
Inference Endpoints
arxiv:
2408.00203
License:
mit
Model card
Files
Files and versions
Community
43
Train
Deploy
Use this model
main
OmniParser
/
icon_caption_florence
3 contributors
History:
2 commits
adamlu1
update readme, add safetensor
7652a5a
2 months ago
LICENSE
Safe
1.14 kB
update readme, add safetensor
2 months ago
config.json
Safe
5.66 kB
add florence model
2 months ago
generation_config.json
Safe
292 Bytes
add florence model
2 months ago
model.safetensors
Safe
1.08 GB
LFS
add florence model
2 months ago