Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
v1v1d
/
vqa_full_lora_64
like
0
Follow
ViViD
3
Image-Text-to-Text
Transformers
Safetensors
multilingual
GOT
feature-extraction
got
vision-language
ocr2.0
custom_code
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Use this model
Edit model card
GOT OCR v1 hi
Downloads last month
4
Safetensors
Model size
716M params
Tensor type
BF16
·
Inference API
Image-Text-to-Text
Inference API (serverless) does not yet support model repos that contain custom code.