Update README.md
Browse files
README.md
CHANGED
@@ -14,7 +14,7 @@ widget:
|
|
14 |
|
15 |
## MiniMA-3B
|
16 |
|
17 |
-
π [arXiv]() | π» [GitHub](https://github.com/GeneZC/MiniMA) | π€ [HuggingFace-MiniMA](https://huggingface.co/GeneZC/MiniMA-3B) | π€ [HuggingFace-MiniChat](https://huggingface.co/GeneZC/MiniChat-3B) | π€ [ModelScope-MiniMA](https://modelscope.cn/models/GeneZC/MiniMA-3B) | π€ [ModelScope-MiniChat](https://modelscope.cn/models/GeneZC/MiniChat-3B)
|
18 |
|
19 |
β Must comply with LICENSE of LLaMA2 since it is derived from LLaMA2.
|
20 |
|
@@ -58,6 +58,6 @@ output = tokenizer.decode(output_ids, skip_special_tokens=True).strip()
|
|
58 |
title={Towards the Law of Capacity Gap in Distilling Language Models},
|
59 |
author={Zhang, Chen and Song, Dawei and Ye, Zheyu and Gao, Yan},
|
60 |
year={2023},
|
61 |
-
url={}
|
62 |
}
|
63 |
```
|
|
|
14 |
|
15 |
## MiniMA-3B
|
16 |
|
17 |
+
π [arXiv](https://arxiv.org/abs/2311.07052) | π» [GitHub](https://github.com/GeneZC/MiniMA) | π€ [HuggingFace-MiniMA](https://huggingface.co/GeneZC/MiniMA-3B) | π€ [HuggingFace-MiniChat](https://huggingface.co/GeneZC/MiniChat-3B) | π€ [ModelScope-MiniMA](https://modelscope.cn/models/GeneZC/MiniMA-3B) | π€ [ModelScope-MiniChat](https://modelscope.cn/models/GeneZC/MiniChat-3B)
|
18 |
|
19 |
β Must comply with LICENSE of LLaMA2 since it is derived from LLaMA2.
|
20 |
|
|
|
58 |
title={Towards the Law of Capacity Gap in Distilling Language Models},
|
59 |
author={Zhang, Chen and Song, Dawei and Ye, Zheyu and Gao, Yan},
|
60 |
year={2023},
|
61 |
+
url={https://arxiv.org/abs/2311.07052}
|
62 |
}
|
63 |
```
|