how does the function ```prepare_for_finetune``` work?

by timaos - opened Aug 30, 2023

Discussion

timaos

Aug 30, 2023

I cannot see the structure of the data to fine-tune cpm-bee

timaos

Aug 30, 2023

When I ran the accelerator-based code, I got the losses whose value was nan.

yingli-Claire

Oct 23

Hi timaos, I meet the same problem and solved this by change the structure of the input data. Since the cpm-bee group changed the tokenizer class, the input data should be list of dict but not the huggingface's conventional dict of list. Just need a data_prepare.py to deal with the input data.

I have put my scripts on github: https://github.com/yingli-Claire/cpm-bee-infra-finetune

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment