x54-729
commited on
Commit
•
3f2f61d
1
Parent(s):
805c395
update opencompass leaderboard url
Browse files
README.md
CHANGED
@@ -45,7 +45,7 @@ We release the SFT version so that the community can study the influence of RLHF
|
|
45 |
|
46 |
### Performance Evaluation
|
47 |
|
48 |
-
We conducted a comprehensive evaluation of InternLM2 using the open-source evaluation tool [OpenCompass](https://github.com/internLM/OpenCompass/). The evaluation covered five dimensions of capabilities: disciplinary competence, language competence, knowledge competence, inference competence, and comprehension competence. Here are some of the evaluation results, and you can visit the [OpenCompass leaderboard](https://opencompass.org.cn/
|
49 |
|
50 |
| Dataset\Models | InternLM2-7B | InternLM2-Chat-7B | InternLM2-20B | InternLM2-Chat-20B | ChatGPT | GPT-4 |
|
51 |
| --- | --- | --- | --- | --- | --- | --- |
|
@@ -200,7 +200,7 @@ InternLM2-Chat-20B-SFT 基于 InternLM2-Base-20B 经过有监督微调(SFT)
|
|
200 |
|
201 |
### 性能评测
|
202 |
|
203 |
-
我们使用开源评测工具 [OpenCompass](https://github.com/internLM/OpenCompass/) 从学科综合能力、语言能力、知识能力、推理能力、理解能力五大能力维度对InternLM开展全面评测,部分评测结果如下表所示,欢迎访问[ OpenCompass 榜单 ](https://opencompass.org.cn/
|
204 |
|
205 |
| 评测集\模型 | InternLM2-7B | InternLM2-Chat-7B | InternLM2-20B | InternLM2-Chat-20B | ChatGPT | GPT-4 |
|
206 |
| --- | --- | --- | --- | --- | --- | --- |
|
|
|
45 |
|
46 |
### Performance Evaluation
|
47 |
|
48 |
+
We conducted a comprehensive evaluation of InternLM2 using the open-source evaluation tool [OpenCompass](https://github.com/internLM/OpenCompass/). The evaluation covered five dimensions of capabilities: disciplinary competence, language competence, knowledge competence, inference competence, and comprehension competence. Here are some of the evaluation results, and you can visit the [OpenCompass leaderboard](https://rank.opencompass.org.cn/leaderboard-llm) for more evaluation results.
|
49 |
|
50 |
| Dataset\Models | InternLM2-7B | InternLM2-Chat-7B | InternLM2-20B | InternLM2-Chat-20B | ChatGPT | GPT-4 |
|
51 |
| --- | --- | --- | --- | --- | --- | --- |
|
|
|
200 |
|
201 |
### 性能评测
|
202 |
|
203 |
+
我们使用开源评测工具 [OpenCompass](https://github.com/internLM/OpenCompass/) 从学科综合能力、语言能力、知识能力、推理能力、理解能力五大能力维度对InternLM开展全面评测,部分评测结果如下表所示,欢迎访问[ OpenCompass 榜单 ](https://rank.opencompass.org.cn/leaderboard-llm)获取更多的评测结果。
|
204 |
|
205 |
| 评测集\模型 | InternLM2-7B | InternLM2-Chat-7B | InternLM2-20B | InternLM2-Chat-20B | ChatGPT | GPT-4 |
|
206 |
| --- | --- | --- | --- | --- | --- | --- |
|