More info

by Sigmally - opened Nov 28, 2023

Discussion

Sigmally

Nov 28, 2023

Hi! Could you share more information about this model? How long has it been trained, what graphics card was it trained on, how was it trained (do you have your own code for finetuning or sth else)?

maywell

Owner Nov 28, 2023

https://huggingface.co/datasets/maywell/hh-rlhf-harmyes

1 Epoch SFT, and 1 Epoch DPO trained using trl library with this dataset

Used 1 x A100 around 3~4 hour.

Sigmally

Nov 28, 2023

thanks!

maywell changed discussion status to closed Nov 28, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment