Muennighoff
commited on
Commit
•
a880778
1
Parent(s):
928e583
Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,27 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
datasets:
|
3 |
+
- bigscience/xP3
|
4 |
+
- bs-la/xP3ru
|
5 |
+
license: bigscience-bloom-rail-1.0
|
6 |
+
---
|
7 |
+
|
8 |
+
# Model Summary
|
9 |
+
|
10 |
+
[bloom-7b1](https://huggingface.co/bigscience/bloom-7b1) finetuned on xP3 enhanced with Russian multitask data. Hence the same as [bloomz-7b1](https://huggingface.co/bigscience/bloomz-7b1), but with additional Russian finetuning data. 4b stands for 4 billion finetuning tokens (same as bloomz-7b1).
|
11 |
+
|
12 |
+
# Citation
|
13 |
+
|
14 |
+
```
|
15 |
+
BLOOM+1 - TODO
|
16 |
+
```
|
17 |
+
|
18 |
+
```bibtex
|
19 |
+
@misc{muennighoff2022crosslingual,
|
20 |
+
title={Crosslingual Generalization through Multitask Finetuning},
|
21 |
+
author={Niklas Muennighoff and Thomas Wang and Lintang Sutawika and Adam Roberts and Stella Biderman and Teven Le Scao and M Saiful Bari and Sheng Shen and Zheng-Xin Yong and Hailey Schoelkopf and Xiangru Tang and Dragomir Radev and Alham Fikri Aji and Khalid Almubarak and Samuel Albanie and Zaid Alyafeai and Albert Webson and Edward Raff and Colin Raffel},
|
22 |
+
year={2022},
|
23 |
+
eprint={2211.01786},
|
24 |
+
archivePrefix={arXiv},
|
25 |
+
primaryClass={cs.CL}
|
26 |
+
}
|
27 |
+
```
|