Llama-2b-from-scratch
Collection
13 items
β’
Updated
Nano Llama of 1.7B parameters.
Built with BuildNanoGPT-Plus.
Trained on 18.1B tokens.
Hellaswag: 0.5243
Validation loss of Cross Entropy: 2.5
This model is available under the Apache 2.0 License.
Join our Discord server here.
Eager to buy me a cup of 2$ coffee or iced tea?π΅β Sure, here is the link: https://ko-fi.com/drnicefellow. Please add a note on which one you want me to drink?