Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,46 @@
|
|
1 |
-
---
|
2 |
-
license: apache-2.0
|
3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: apache-2.0
|
3 |
+
datasets:
|
4 |
+
- shareAI/DPO-zh-en-emoji
|
5 |
+
language:
|
6 |
+
- zh
|
7 |
+
- en
|
8 |
+
pipeline_tag: question-answering
|
9 |
+
tags:
|
10 |
+
- dpo
|
11 |
+
- llama3.1
|
12 |
+
- llama3
|
13 |
+
- chat
|
14 |
+
---
|
15 |
+
# llama3-instruct 中文DPO版
|
16 |
+
### 模型介绍
|
17 |
+
像原版instruct一样,喜欢用有趣中文和表情符号回答问题。
|
18 |
+
Github:https://github.com/CrazyBoyM/llama3-Chinese-chat
|
19 |
+
放出训练配方细节供网友参考分享:
|
20 |
+
DPO(beta 0.5) + lora rank128, alpha256 + 打开"lm_head", "input_layernorm", "post_attention_layernorm", "norm"层训练.
|
21 |
+
特点:偏好中文和emoji表情,且不损伤原instruct版模型能力。实测中文DPO版问答性能体验超过现在市面上任何llama3中文微调版 (微调会破坏llama3原版能力,导致遗忘)
|
22 |
+
![Alt text](image.png)
|
23 |
+
|
24 |
+
### 模型部署
|
25 |
+
网页脚本文件:https://github.com/CrazyBoyM/llama3-Chinese-chat/blob/main/deploy/web_streamlit_for_instruct_v2.py
|
26 |
+
```
|
27 |
+
pip install streamlit
|
28 |
+
streamlit run web_streamlit_for_instruct_v2.py ./Llama3-Chinese-instruct-DPO-beta0.5
|
29 |
+
```
|
30 |
+
|
31 |
+
### 模型下载
|
32 |
+
SDK下载
|
33 |
+
```bash
|
34 |
+
#安装ModelScope
|
35 |
+
pip install modelscope
|
36 |
+
```
|
37 |
+
```python
|
38 |
+
#SDK模型下载
|
39 |
+
from modelscope import snapshot_download
|
40 |
+
model_dir = snapshot_download('baicai003/Llama3-Chinese-instruct-DPO-beta0.5')
|
41 |
+
```
|
42 |
+
Git下载
|
43 |
+
```
|
44 |
+
#Git模型下载
|
45 |
+
git clone https://www.modelscope.cn/baicai003/Llama3-Chinese-instruct-DPO-beta0.5.git
|
46 |
+
```
|