slm-experiments
Collection
15 items
•
Updated
This model is a fine-tuned version of HuggingFaceTB/SmolLM-1.7B-Instruct on the generator dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
Training Loss | Epoch | Step | Validation Loss |
---|---|---|---|
No log | 0.8 | 1 | 1.9677 |
No log | 1.6 | 2 | 1.9588 |
No log | 2.4 | 3 | 1.9242 |
No log | 4.0 | 5 | 1.8088 |
No log | 4.8 | 6 | 1.7755 |
No log | 5.6 | 7 | 1.7593 |
No log | 6.4 | 8 | 1.7526 |
1.8621 | 8.0 | 10 | 1.7491 |
Base model
HuggingFaceTB/SmolLM-1.7B