DataGemma Release Collection A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated Sep 12 • 77
Evaluation Datasets Collection Collection of Romanian datasets used for evaluation • 8 items • Updated Oct 11 • 1
SFT Datasets Collection Collection of Romanian datasets used for supervised finetuning • 9 items • Updated Oct 11 • 1
MultiLegalPile Models Collection A 689GB Multilingual Legal Corpus • 33 items • Updated Oct 23, 2023 • 1