llama3-8B-lima / README.md
HenryJJ's picture
Update README.md
0a6d915 verified
|
raw
history blame
937 Bytes
metadata
license: apache-2.0
datasets:
  - 64bits/lima_vicuna_format

llama3-8B-lima

Built with Axolotl

SFT with 64bits/lima_vicuna_format. 3 epoch qlora

Model Details

  • Trained by: trained by HenryJJ.
  • Model type: llama3 is an auto-regressive language model based on the Llama 3 transformer architecture.
  • Language(s): English
  • License for llama3-8B-lima: apache-2.0 license

Prompting

Prompt format chatml: This model uses ChatML prompt format.

<|im_start|>system
You are a helpful AI assistant.<|im_end|>
<|im_start|>user
{prompt}<|im_end|>
<|im_start|>assistant

Example:

<|im_start|>system
You are a helpful assistant.
<|im_start|>user
who is the president of us
<|im_start|>assistant