how does the function ```prepare_for_finetune``` work?

#7
by timaos - opened

I cannot see the structure of the data to fine-tune cpm-bee

When I ran the accelerator-based code, I got the losses whose value was nan.

Hi timaos, I meet the same problem and solved this by change the structure of the input data. Since the cpm-bee group changed the tokenizer class, the input data should be list of dict but not the huggingface's conventional dict of list. Just need a data_prepare.py to deal with the input data.

I have put my scripts on github: https://github.com/yingli-Claire/cpm-bee-infra-finetune

Sign up or log in to comment