core1-base-464m-c4 / README.md

crumb

Create README.md

b9ef701 about 1 year ago

preview code

raw

history blame

261 Bytes

metadata

datasets:
  - c4
language:
  - en

CoreX models are Llama models in which the first X decoder layers are kept, and then the model is finetuned on 1 billion tokens from some dataset. Base model stems from Llama2-7b, medium from Llama2-13b, xl from Llama2-70b.