--- license: apache-2.0 language: - zh library_name: transformers tags: - llama - llama3 - dpo-zh - emoji datasets: - shareAI/DPO-zh-en-emoji --- Github:https://github.com/CrazyBoyM/llama3-Chinese-chat 放出训练配方细节供网友参考分享: DPO(beta 0.5) + lora rank128, alpha256 + 打开"lm_head", "input_layernorm", "post_attention_layernorm", "norm"层训练。 偏好中文和emoji表情,且不损伤原instruct版模型能力。 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/631f5b422225f12fc0f2c838/2xlWxZvN0gahckA2EPmlE.png) Git下载 ``` #Git模型下载 git clone https://www.modelscope.cn/baicai003/Llama3-Chinese-instruct-DPO-beta0.5.git ```