๐ฎ๐น๐ฏ๐ต๐ง๐ท Generating multilingual instruction datasets with Magpie ๐ฆโโฌ
โข
18
@Mollel created another dataset using Glot for language detection instead of fastText.
https://huggingface.co/datasets/sartifyllc/tulu-3-sft-mixture-language-glot
Good work!