This is a GPT-NeoX model trained on 50 billion tokens from The Pile, using the Online Data Mixing method.
The OpenLLM leaderboard won't let me submit my model because the description is too short, so I'm adding more characters to the description in hopes that it will be evaluated.
- Downloads last month
- 70
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.