Baicai003 commited on
Commit
f95efe0
1 Parent(s): aff3863

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +46 -3
README.md CHANGED
@@ -1,3 +1,46 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ datasets:
4
+ - shareAI/DPO-zh-en-emoji
5
+ language:
6
+ - zh
7
+ - en
8
+ pipeline_tag: question-answering
9
+ tags:
10
+ - dpo
11
+ - llama3.1
12
+ - llama3
13
+ - chat
14
+ ---
15
+ # llama3-instruct 中文DPO版
16
+ ### 模型介绍
17
+ 像原版instruct一样,喜欢用有趣中文和表情符号回答问题。
18
+ Github:https://github.com/CrazyBoyM/llama3-Chinese-chat
19
+ 放出训练配方细节供网友参考分享:
20
+ DPO(beta 0.5) + lora rank128, alpha256 + 打开"lm_head", "input_layernorm", "post_attention_layernorm", "norm"层训练.
21
+ 特点:偏好中文和emoji表情,且不损伤原instruct版模型能力。实测中文DPO版问答性能体验超过现在市面上任何llama3中文微调版 (微调会破坏llama3原版能力,导致遗忘)
22
+ ![Alt text](image.png)
23
+
24
+ ### 模型部署
25
+ 网页脚本文件:https://github.com/CrazyBoyM/llama3-Chinese-chat/blob/main/deploy/web_streamlit_for_instruct_v2.py
26
+ ```
27
+ pip install streamlit
28
+ streamlit run web_streamlit_for_instruct_v2.py ./Llama3-Chinese-instruct-DPO-beta0.5
29
+ ```
30
+
31
+ ### 模型下载
32
+ SDK下载
33
+ ```bash
34
+ #安装ModelScope
35
+ pip install modelscope
36
+ ```
37
+ ```python
38
+ #SDK模型下载
39
+ from modelscope import snapshot_download
40
+ model_dir = snapshot_download('baicai003/Llama3-Chinese-instruct-DPO-beta0.5')
41
+ ```
42
+ Git下载
43
+ ```
44
+ #Git模型下载
45
+ git clone https://www.modelscope.cn/baicai003/Llama3-Chinese-instruct-DPO-beta0.5.git
46
+ ```