yuexiang96
commited on
Commit
•
7132050
1
Parent(s):
94c2d88
Update README.md
Browse files
README.md
CHANGED
@@ -5,7 +5,9 @@ datasets:
|
|
5 |
language:
|
6 |
- en
|
7 |
---
|
8 |
-
# MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning
|
|
|
|
|
9 |
|
10 |
Paper: [https://arxiv.org/pdf/2309.05653.pdf](https://arxiv.org/pdf/2309.05653.pdf)
|
11 |
|
@@ -13,14 +15,14 @@ Code: [https://github.com/TIGER-AI-Lab/MAmmoTH](https://github.com/TIGER-AI-Lab/
|
|
13 |
|
14 |
|
15 |
## Introduction
|
16 |
-
We introduce 🦣MAmmoTH, a series of open-source large language models (LLMs) specifically tailored for general math problem-solving. The MAmmoTH models are trained on 🤗 [MathInstruct Dataset](https://huggingface.co/datasets/TIGER-Lab/MathInstruct), a meticulously curated instruction tuning dataset that is lightweight yet generalizable. MathInstruct is compiled from 13 math rationale datasets, six of which are newly curated by this work. It uniquely focuses on the hybrid use of chain-of-thought (CoT) and program-of-thought (PoT) rationales, and ensures extensive coverage of diverse mathematical fields.
|
17 |
|
18 |
| | **Base Model: Llama-2** | **Base Model: Code Llama** |
|
19 |
|-----|---------------------------------------------------------------|--------------------------------------------------------------------------|
|
20 |
-
| 7B | 🦣[MAmmoTH-7B](https://huggingface.co/TIGER-Lab/MAmmoTH-7B) | 🦣[MAmmoTH-Coder-7B](https://huggingface.co/TIGER-Lab/MAmmoTH-Coder-7B) |
|
21 |
-
| 13B | 🦣[MAmmoTH-13B](https://huggingface.co/TIGER-Lab/MAmmoTH-13B) | 🦣[MAmmoTH-Coder-13B](https://huggingface.co/TIGER-Lab/MAmmoTH-Coder-13B)|
|
22 |
-
| 34B | - | 🦣[MAmmoTH-Coder-34B](https://huggingface.co/TIGER-Lab/MAmmoTH-Coder-34B)|
|
23 |
-
| 70B | 🦣[MAmmoTH-70B](https://huggingface.co/TIGER-Lab/MAmmoTH-70B) | - |
|
24 |
|
|
25 |
|
26 |
|
|
|
5 |
language:
|
6 |
- en
|
7 |
---
|
8 |
+
# 🦣 MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning
|
9 |
+
|
10 |
+
Project Page: [https://tiger-ai-lab.github.io/MAmmoTH/](https://tiger-ai-lab.github.io/MAmmoTH/)
|
11 |
|
12 |
Paper: [https://arxiv.org/pdf/2309.05653.pdf](https://arxiv.org/pdf/2309.05653.pdf)
|
13 |
|
|
|
15 |
|
16 |
|
17 |
## Introduction
|
18 |
+
We introduce 🦣 MAmmoTH, a series of open-source large language models (LLMs) specifically tailored for general math problem-solving. The MAmmoTH models are trained on 🤗 [MathInstruct Dataset](https://huggingface.co/datasets/TIGER-Lab/MathInstruct), a meticulously curated instruction tuning dataset that is lightweight yet generalizable. MathInstruct is compiled from 13 math rationale datasets, six of which are newly curated by this work. It uniquely focuses on the hybrid use of chain-of-thought (CoT) and program-of-thought (PoT) rationales, and ensures extensive coverage of diverse mathematical fields.
|
19 |
|
20 |
| | **Base Model: Llama-2** | **Base Model: Code Llama** |
|
21 |
|-----|---------------------------------------------------------------|--------------------------------------------------------------------------|
|
22 |
+
| 7B | 🦣 [MAmmoTH-7B](https://huggingface.co/TIGER-Lab/MAmmoTH-7B) | 🦣 [MAmmoTH-Coder-7B](https://huggingface.co/TIGER-Lab/MAmmoTH-Coder-7B) |
|
23 |
+
| 13B | 🦣 [MAmmoTH-13B](https://huggingface.co/TIGER-Lab/MAmmoTH-13B) | 🦣 [MAmmoTH-Coder-13B](https://huggingface.co/TIGER-Lab/MAmmoTH-Coder-13B)|
|
24 |
+
| 34B | - | 🦣 [MAmmoTH-Coder-34B](https://huggingface.co/TIGER-Lab/MAmmoTH-Coder-34B)|
|
25 |
+
| 70B | 🦣 [MAmmoTH-70B](https://huggingface.co/TIGER-Lab/MAmmoTH-70B) | - |
|
26 |
|
|
27 |
|
28 |
|