This is a GPT-NeoX model trained on 50 billion tokens from The Pile, using the Online Data Mixing method.
The OpenLLM leaderboard won't let me submit my model because the description is too short, so I'm adding more characters to the description in hopes that it will be evaluated.
- Downloads last month
- 145
Inference Providers
NEW
This model is not currently available via any of the supported Inference Providers.