File size: 679 Bytes
1a542da 431bf32 1a542da 431bf32 |
1 2 3 4 5 6 7 8 9 |
[CodeParrot](https://huggingface.co/lvwerra/codeparrot) uses GPT-2 architecture with BPE tokenizer trained on Python code. We released this model as an educational tool for training large language models from scratch on code, with detailed tutorials and descriptions of the training process. It makes use of [Accelerate](https://huggingface.co/docs/accelerate/index) for distributed training and mixed precision. See this [blog](https://huggingface.co/blog/codeparrot) and [repo](https://github.com/huggingface/transformers/tree/main/examples/research_projects/codeparrot) for more details.
<center>
|Model | # parameters |
| - | - |
| GPT2 | 1.5B |
</center> |