Model Card

This is Qwen2-VL 2B, fine-tuned for OCR/HTR with Spanish language historical documents using data from neulab/PangeaInstruct. Each image has a red box around an area of text in the image. The model is asked to return the text inside.

For the training data see

Pangea (task_data_vmultilingual_cc_news_es_curated.tar)
apjanco/fmb_primera_muestra_redboxes

Model Details

This is the model card of a 🤗 transformers model that has been pushed on the Hub.

Developed by: Andrew Janco
Model type: Qwen2-VL
Language(s) (NLP): Spanish
License: MIT
Finetuned from model [optional]: Qwen2-VL 2B

Uses

This model is part of experiments to extract text from historical handwritten documents.

apjanco
/

es_qwen2_vl_pangea

Model Card

Model Details

Uses

Model tree for apjanco/es_qwen2_vl_pangea

Datasets used to train apjanco/es_qwen2_vl_pangea