Llama-3.2-1B
Collection
8 items
•
Updated
•
1
This model is a fine-tuned version of unsloth/meta-llama-3.1-8b-instruct-bnb-4bit on the None dataset.
This model was trained only on the top 1 model from clembench version 0.9 and 1.0.
The dataset id is D20003 and it contains approximately 350 played episodes
The following hyperparameters were used during training: