JustinLin610
commited on
Commit
•
b07efc4
1
Parent(s):
2292f49
Update README.md
Browse files
README.md
CHANGED
@@ -16,7 +16,6 @@ tags:
|
|
16 |
|
17 |
Qwen2.5 is the latest series of Qwen large language models. For Qwen2.5, we release a number of base language models and instruction-tuned language models ranging from 0.5 to 72 billion parameters. Qwen2.5 brings the following improvements upon Qwen2:
|
18 |
|
19 |
-
- Pretrained on our **latest large-scale dataset**, encompassing up to **18T tokens**.
|
20 |
- Significantly **more knowledge** and has greatly improved capabilities in **coding** and **mathematics**, thanks to our specialized expert models in these domains.
|
21 |
- Significant improvements in **instruction following**, **generating long texts** (over 8K tokens), **understanding structured data** (e.g, tables), and **generating structured outputs** especially JSON. **More resilient to the diversity of system prompts**, enhancing role-play implementation and condition-setting for chatbots.
|
22 |
- **Long-context Support** up to 128K tokens and can generate up to 8K tokens.
|
@@ -91,7 +90,7 @@ For deployment, we recommend using vLLM. Please refer to our [Github](https://gi
|
|
91 |
|
92 |
**Note**: Presently, vLLM only supports static YARN, which means the scaling factor remains constant regardless of input length, **potentially impacting performance on shorter texts**. We advise adding the `rope_scaling` configuration only when processing long contexts is required.
|
93 |
|
94 |
-
##
|
95 |
|
96 |
Detailed evaluation results are reported in this [📑 blog](https://qwenlm.github.io/blog/qwen2.5/).
|
97 |
|
|
|
16 |
|
17 |
Qwen2.5 is the latest series of Qwen large language models. For Qwen2.5, we release a number of base language models and instruction-tuned language models ranging from 0.5 to 72 billion parameters. Qwen2.5 brings the following improvements upon Qwen2:
|
18 |
|
|
|
19 |
- Significantly **more knowledge** and has greatly improved capabilities in **coding** and **mathematics**, thanks to our specialized expert models in these domains.
|
20 |
- Significant improvements in **instruction following**, **generating long texts** (over 8K tokens), **understanding structured data** (e.g, tables), and **generating structured outputs** especially JSON. **More resilient to the diversity of system prompts**, enhancing role-play implementation and condition-setting for chatbots.
|
21 |
- **Long-context Support** up to 128K tokens and can generate up to 8K tokens.
|
|
|
90 |
|
91 |
**Note**: Presently, vLLM only supports static YARN, which means the scaling factor remains constant regardless of input length, **potentially impacting performance on shorter texts**. We advise adding the `rope_scaling` configuration only when processing long contexts is required.
|
92 |
|
93 |
+
## Evaluation & Performance
|
94 |
|
95 |
Detailed evaluation results are reported in this [📑 blog](https://qwenlm.github.io/blog/qwen2.5/).
|
96 |
|