Training Data

by dennisc1 - opened Jan 19

Jan 19

Hi Jorge,

I've been waiting for a Dutch mixtral finetune, very cool! Could you say how you trained the model? Which data did you use?

Owner Jan 20

It's been trained on:

It trained for about 3 epochs on 1 H100 but after evaluation the hellaswag_nl it dropped in performance.

So use with caution!

I will try at a later moment after I read the Mixtral paper

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment