Finetuning

by dimidd - opened Apr 10

Apr 10

Thanks for this model!

I'd like to finetune it on my own data and have a question:
In the Vitamin C dataset, the label is an ordinal variable which has 3 levels: "REFUTES", "NOT ENOUGH DATA", and "SUPPORTS".
However, your training code expects the label to be an int:

for i, row in df_train.iterrows():
   train_examples.append(InputExample(texts=[row['source'], row['summary']], label=int(row['label'])))

Could you please elaborate how are these levels converted to ints?

amin3141

Vectara org Apr 11

I'm not the author, but I'd imagine you convert REFUTES to 0, SUPPORTS to 1, and then you have to options for NOT ENOUGH DATA: you could either drop it, or treat it as 0.

dimidd

Apr 11

Right, thank you Amin. I just want to be consistent with the original training data.

forrest-vectara changed discussion status to closed Oct 6

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment