Edit model card

Nano Llama of 1.7B parameters.

Built with BuildNanoGPT-Plus.

Trained on 18.1B tokens.

Evaluation

Hellaswag: 0.5243

Validation loss of Cross Entropy: 2.5

License

This model is available under the Apache 2.0 License.

Discord Server

Join our Discord server here.

Feeling Generous? 😊

Eager to buy me a cup of 2$ coffee or iced tea?πŸ΅β˜• Sure, here is the link: https://ko-fi.com/drnicefellow. Please add a note on which one you want me to drink?

Downloads last month
2
Safetensors
Model size
1.72B params
Tensor type
F32
Β·
Inference API
Unable to determine this model's library. Check the docs .

Collections including DrNicefellow/Llama-2b-36d2k_steps