Update README.md
Browse files
README.md
CHANGED
@@ -30,10 +30,12 @@ Key Highlights:
|
|
30 |
- Best of N: By leveraging a combination of response sampling and Best-of-N strategies, we choose the response of top score judged by reward model, yielding better results with spending more inference time. For example, Qwen2.5-Math-1.5B-Instruct obtains 83.9 on MATH in RM@8 setting and even surpasses the performance of Qwen2.5-Math-7B-Instruct 83.6 with greedy decoding.
|
31 |
- Comparasion with majority voting (Maj@N): RM@N scores are substantially better than Maj@N scores aross almost all benchmarks and models.
|
32 |
|
|
|
|
|
33 |
|
34 |
## Model Details
|
35 |
|
36 |
-
For more details, please refer to our [blog post](https://qwenlm.github.io/blog/qwen2-math/) and [GitHub repo](https://github.com/QwenLM/Qwen2-Math).
|
37 |
|
38 |
|
39 |
## Requirements
|
@@ -46,7 +48,7 @@ For more details, please refer to our [blog post](https://qwenlm.github.io/blog/
|
|
46 |
> </b>
|
47 |
> </div>
|
48 |
|
49 |
-
For requirements on GPU memory and the respective throughput, see similar results of Qwen2
|
50 |
|
51 |
## Quick Start
|
52 |
|
|
|
30 |
- Best of N: By leveraging a combination of response sampling and Best-of-N strategies, we choose the response of top score judged by reward model, yielding better results with spending more inference time. For example, Qwen2.5-Math-1.5B-Instruct obtains 83.9 on MATH in RM@8 setting and even surpasses the performance of Qwen2.5-Math-7B-Instruct 83.6 with greedy decoding.
|
31 |
- Comparasion with majority voting (Maj@N): RM@N scores are substantially better than Maj@N scores aross almost all benchmarks and models.
|
32 |
|
33 |
+
![](http://qianwen-res.oss-accelerate-overseas.aliyuncs.com/Qwen2.5/qwen2.5-math-pipeline.jpeg)
|
34 |
+
|
35 |
|
36 |
## Model Details
|
37 |
|
38 |
+
For more details, please refer to our [blog post](https://qwenlm.github.io/blog/qwen2.5-math/) and [GitHub repo](https://github.com/QwenLM/Qwen2.5-Math).
|
39 |
|
40 |
|
41 |
## Requirements
|
|
|
48 |
> </b>
|
49 |
> </div>
|
50 |
|
51 |
+
For requirements on GPU memory and the respective throughput, see similar results of Qwen2 [here](https://qwen.readthedocs.io/en/latest/benchmark/speed_benchmark.html).
|
52 |
|
53 |
## Quick Start
|
54 |
|