Unofficial dataset
#2
by
SinanAkkoyun
- opened
Hey, is there an unofficial dataset available? Would love to train tinyllama with phi's dataset
My best guess is your never getting the dataset
Same, love a finetune for tinyllama with this data, but it looks like MS wont be releasing it.
Yeah
Nice
@kotyKD Thank you so much :) I will keep this open if others emerge
If anyone wants to see an even smaller model off this, I have a project https://github.com/VatsaDev/NanoPhi, where I'm trying to finetune GPT-2 off these unofficial datasets
gugarosa
changed discussion status to
closed