--- license: bigscience-bloom-rail-1.0 datasets: - swap-uniba/itwiki-march-2024 language: - it tags: - bloom - italian --- # Model Card for Model ID The model is obtained by performing language adaptation on the original bloom-1b7 model. In detail, we continued the pre-training on Italian-specific data without adaptation of the vocabulary. We use about 2.8M documents obtained from Italian Wikimedia dumps (swap-uniba/itwiki-march-2024). The model is trained for one epoch using LoRA and SFT. ## Model Details ### Model Description - **Developed by:** SWAP Research Group, Department of Computer Science, University of Bari Aldo Moro - **Model type:** BLOOM - **Language(s) (NLP):** Italian - **License:** bigscience-bloom-rail-1.0 - **Finetuned from model [optional]:** bloom-1b7 ## Training Details ### Training Data 2.8M documents obtained from Italian Wikimedia dumps (swap-uniba/itwiki-march-2024). ### Training Procedure LoRA and SFT. #### Training Hyperparameters - **Training regime:** fp16 ## Citation [optional] **BibTeX:** **APA:** ## Model Card Authors [optional] Pierpaolo Basile, University of Bari Aldo Moro, Italy. ## Model Card Contact Pierpaolo Basile, University of Bari Aldo Moro, Italy.