wenge-research
commited on
Commit
•
0625bed
1
Parent(s):
c9b8c63
Update README.md
Browse files
README.md
CHANGED
@@ -16,15 +16,15 @@ license: other
|
|
16 |
|
17 |
|
18 |
## 介绍/Introduction
|
19 |
-
YAYI 2 是中科闻歌研发的开源大语言模型,包括 Base 和 Chat 版本,参数规模为 30B。YAYI2-30B 是基于 Transformer 的大语言模型,采用了 2.65 万亿 Tokens
|
20 |
|
21 |
-
|
22 |
|
23 |
|
24 |
|
25 |
-
YAYI 2 is a collection of open-source large language models launched by Wenge Technology. YAYI2-30B is a Transformer-based large language model, and has been pretrained for 2.65 trillion tokens of multilingual data with high quality. The base model is aligned with human values through supervised fine-tuning with millions of instructions and reinforcement learning from human feedback (RLHF).
|
26 |
|
27 |
-
For more details about the YAYI 2, please refer to our
|
28 |
|
29 |
|
30 |
## 模型细节/Model Details
|
@@ -42,7 +42,7 @@ For more details about the YAYI 2, please refer to our GitHub repository. Stay t
|
|
42 |
|
43 |
* python 3.8及以上版本
|
44 |
* pytorch 2.0.1 及以上版本
|
45 |
-
* 建议使用 CUDA 11.7
|
46 |
* 运行 BF16 或 FP16 模型需要至少80GB显存(例如1xA100)
|
47 |
|
48 |
|
@@ -222,7 +222,7 @@ We evaluate our model on standard benchmarks, including C-Eval, MMLU, CMMLU, AGI
|
|
222 |
<tr>
|
223 |
<td><strong>YAYI2-30B</strong></td>
|
224 |
<td style="text-align: center;">80.9</td>
|
225 |
-
<td style="text-align: center;">80.5</td>
|
226 |
<td style="text-align: center;"><b>62.0</b></td>
|
227 |
<td style="text-align: center;"><b>84.0</b></td>
|
228 |
<td style="text-align: center;">64.4</td>
|
@@ -243,7 +243,7 @@ We evaluate our model using the source code from the [OpenCompass Github reposit
|
|
243 |
|
244 |
## 协议/License
|
245 |
|
246 |
-
本项目中的代码依照 [Apache-2.0](LICENSE) 协议开源,社区使用 YAYI 2 模型和数据需要遵循[雅意YAYI 2 模型社区许可协议](YAYI2_Community_License)。若您需要将雅意 YAYI 2系列模型或其衍生品用作商业用途,请根据[《雅意 YAYI 2 模型商用许可协议》](YAYI2_Commercial_License)将商用许可申请登记信息发送至指定邮箱[email protected]。审核通过后,雅意将授予您商用版权许可,请遵循协议中的商业许可限制。
|
247 |
|
248 |
The code in this project is open-sourced under the [Apache-2.0](LICENSE) license. The use of YaYi series model weights and data must adhere to the [YAYI 2 Community License](YAYI2_Community_License). If you intend to use the YAYI 2 series models or their derivatives for commercial purposes, please submit your commercial license application and registration information to [email protected], following the [YAYI 2 Commercial License](YAYI2_Commercial_License). Upon approval, YAYI will grant you a commercial copyright license, subject to the commercial license restrictions outlined in the agreement.
|
249 |
|
@@ -257,7 +257,7 @@ If you are using the resource for your work, please cite our paper.
|
|
257 |
|
258 |
```
|
259 |
@article{YAYI 2,
|
260 |
-
author = {Yin Luo, Qingchao Kong, Nan Xu, et.al.}
|
261 |
title = {YAYI 2: Multilingual Open Source Large Language Models},
|
262 |
journal = {arXiv preprint arXiv},
|
263 |
year = {2023}
|
|
|
16 |
|
17 |
|
18 |
## 介绍/Introduction
|
19 |
+
YAYI 2 是中科闻歌研发的开源大语言模型,包括 Base 和 Chat 版本,参数规模为 30B。YAYI2-30B 是基于 Transformer 的大语言模型,采用了 2.65 万亿 Tokens 的高质量、多语言语料进行预训练。针对通用和特定领域的应用场景,我们采用了百万级指令进行微调,同时借助人类反馈强化学习方法,以更好地使模型与人类价值观对齐。
|
20 |
|
21 |
+
本次开源的模型为 YAYI2-30B Base 模型。如果您想了解更多关于 YAYI 2 模型的细节,我们建议您参阅 [GitHub](https://github.com/wenge-research/YAYI2) 仓库。更多技术细节,敬请期待我们的技术报告🔥。
|
22 |
|
23 |
|
24 |
|
25 |
+
YAYI 2 is a collection of open-source large language models launched by Wenge Technology. YAYI2-30B is a Transformer-based large language model, and has been pretrained for 2.65 trillion tokens of multilingual data with high quality. The base model is aligned with human values through supervised fine-tuning with millions of instructions and reinforcement learning from human feedback (RLHF).
|
26 |
|
27 |
+
We opensource the pre-trained language model in this release, namely **YAYI2-30B**. For more details about the YAYI 2, please refer to our [GitHub](https://github.com/wenge-research/YAYI2) repository. Stay tuned for more technical details in our upcoming technical report! 🔥
|
28 |
|
29 |
|
30 |
## 模型细节/Model Details
|
|
|
42 |
|
43 |
* python 3.8及以上版本
|
44 |
* pytorch 2.0.1 及以上版本
|
45 |
+
* 建议使用 CUDA 11.7 及以上版本
|
46 |
* 运行 BF16 或 FP16 模型需要至少80GB显存(例如1xA100)
|
47 |
|
48 |
|
|
|
222 |
<tr>
|
223 |
<td><strong>YAYI2-30B</strong></td>
|
224 |
<td style="text-align: center;">80.9</td>
|
225 |
+
<td style="text-align: center;"><b>80.5</b></td>
|
226 |
<td style="text-align: center;"><b>62.0</b></td>
|
227 |
<td style="text-align: center;"><b>84.0</b></td>
|
228 |
<td style="text-align: center;">64.4</td>
|
|
|
243 |
|
244 |
## 协议/License
|
245 |
|
246 |
+
本项目中的代码依照 [Apache-2.0](LICENSE) 协议开源,社区使用 YAYI 2 模型和数据需要遵循[雅意YAYI 2 模型社区许可协议](YAYI2_Community_License)。若您需要将雅意 YAYI 2系列模型或其衍生品用作商业用途,请根据[《雅意 YAYI 2 模型商用许可协议》](YAYI2_Commercial_License)将商用许可申请登记信息发送至指定邮箱 [email protected]。审核通过后,雅意将授予您商用版权许可,请遵循协议中的商业许可限制。
|
247 |
|
248 |
The code in this project is open-sourced under the [Apache-2.0](LICENSE) license. The use of YaYi series model weights and data must adhere to the [YAYI 2 Community License](YAYI2_Community_License). If you intend to use the YAYI 2 series models or their derivatives for commercial purposes, please submit your commercial license application and registration information to [email protected], following the [YAYI 2 Commercial License](YAYI2_Commercial_License). Upon approval, YAYI will grant you a commercial copyright license, subject to the commercial license restrictions outlined in the agreement.
|
249 |
|
|
|
257 |
|
258 |
```
|
259 |
@article{YAYI 2,
|
260 |
+
author = {Yin Luo, Qingchao Kong, Nan Xu, et.al.},
|
261 |
title = {YAYI 2: Multilingual Open Source Large Language Models},
|
262 |
journal = {arXiv preprint arXiv},
|
263 |
year = {2023}
|