Model Card

This is Qwen2-VL 2B, fine-tuned for OCR/HTR with Spanish language historical documents using data from neulab/PangeaInstruct. Each image has a red box around an area of text in the image. The model is asked to return the text inside.

For the training data see

Model Details

This is the model card of a 🤗 transformers model that has been pushed on the Hub.

  • Developed by: Andrew Janco
  • Model type: Qwen2-VL
  • Language(s) (NLP): Spanish
  • License: MIT
  • Finetuned from model [optional]: Qwen2-VL 2B

Uses

This model is part of experiments to extract text from historical handwritten documents.

Downloads last month
21
Safetensors
Model size
2.21B params
Tensor type
BF16
·
Inference Examples
Inference API (serverless) does not yet support transformers models for this pipeline type.

Model tree for apjanco/es_qwen2_vl_pangea

Base model

Qwen/Qwen2-VL-2B
Finetuned
(53)
this model

Datasets used to train apjanco/es_qwen2_vl_pangea