Document Processing Collection Any model or dataset dealing with documentary-type objects (layout detection, VQA, OCR, etc.) • 9 items • Updated Nov 14 • 3
DataGemma Release Collection A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated 12 days ago • 81
Evaluation Datasets Collection Collection of Romanian datasets used for evaluation • 8 items • Updated Oct 11 • 1
SFT Datasets Collection Collection of Romanian datasets used for supervised finetuning • 9 items • Updated Oct 11 • 1
MultiLegalPile Models Collection A 689GB Multilingual Legal Corpus • 33 items • Updated Oct 23, 2023 • 1