Unofficial dataset

#2
by SinanAkkoyun - opened

Hey, is there an unofficial dataset available? Would love to train tinyllama with phi's dataset

My best guess is your never getting the dataset

Same, love a finetune for tinyllama with this data, but it looks like MS wont be releasing it.

@kotyKD Thank you so much :) I will keep this open if others emerge

@VatsaDev Thank you! 😍

If anyone wants to see an even smaller model off this, I have a project https://github.com/VatsaDev/NanoPhi, where I'm trying to finetune GPT-2 off these unofficial datasets

gugarosa changed discussion status to closed

Sign up or log in to comment