kanishka
/

opt-babylm2-rewritten-clean-spacy-32k-earlystop-40epochs_seed-42_3e-4

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

opt-babylm2-rewritten-clean-spacy-32k-earlystop-40epochs_seed-42_3e-4

1 contributor

History: 3 commits

kanishka's picture

End of training

3f74250 verified 11 days ago

.gitattributes

1.52 kB

initial commit 12 days ago
README.md

3.38 kB

End of training 11 days ago
added_tokens.json

29 Bytes

Model save 11 days ago
all_results.json

486 Bytes

End of training 11 days ago
config.json

804 Bytes

Model save 11 days ago
eval_results.json

269 Bytes

End of training 11 days ago
generation_config.json

132 Bytes

Model save 11 days ago
model.safetensors

442 MB
LFS

Model save 11 days ago
special_tokens_map.json

551 Bytes

Model save 11 days ago
tokenizer.json

771 kB

Model save 11 days ago
tokenizer_config.json

1.07 kB

Model save 11 days ago
train_results.json

238 Bytes

End of training 11 days ago
trainer_state.json

15.4 kB

End of training 11 days ago
training_args.bin
Detected Pickle imports (9)
- "torch.device",
- "transformers.trainer_utils.IntervalStrategy",
- "transformers.trainer_utils.HubStrategy",
- "transformers.trainer_pt_utils.AcceleratorConfig",
- "accelerate.state.PartialState",
- "accelerate.utils.dataclasses.DistributedType",
- "transformers.training_args.OptimizerNames",
- "transformers.trainer_utils.SchedulerType",
- "transformers.training_args.TrainingArguments"
How to fix it?
5.37 kB
LFS

Model save 11 days ago
vocab.json

507 kB

Model save 11 days ago