bloom-1b7-it / README.md
basilepp19's picture
Update README.md
baf0b44 verified
---
license: bigscience-bloom-rail-1.0
datasets:
- swap-uniba/itwiki-march-2024
language:
- it
tags:
- bloom
- italian
---
# Model Card for Model ID
<!-- Provide a quick summary of what the model is/does. -->
The model is obtained by performing language adaptation on the original bloom-1b7 model.
In detail, we continued the pre-training on Italian-specific data without adaptation of the vocabulary.
We use about 2.8M documents obtained from Italian Wikimedia dumps (swap-uniba/itwiki-march-2024). The model is trained for one epoch using LoRA and SFT.
## Model Details
### Model Description
<!-- Provide a longer summary of what this model is. -->
- **Developed by:** SWAP Research Group, Department of Computer Science, University of Bari Aldo Moro
- **Model type:** BLOOM
- **Language(s) (NLP):** Italian
- **License:** bigscience-bloom-rail-1.0
- **Finetuned from model [optional]:** bloom-1b7
## Training Details
### Training Data
2.8M documents obtained from Italian Wikimedia dumps (swap-uniba/itwiki-march-2024).
### Training Procedure
LoRA and SFT.
#### Training Hyperparameters
- **Training regime:** fp16
## Citation [optional]
<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
**BibTeX:**
**APA:**
## Model Card Authors [optional]
Pierpaolo Basile, University of Bari Aldo Moro, Italy.
## Model Card Contact
Pierpaolo Basile, University of Bari Aldo Moro, Italy.