sanchit-gandhi
/

distil-zephyr-1.5b-ssft

Text Generation

alignment-handbook

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

distil-zephyr-1.5b-ssft

1 contributor

History: 6 commits

sanchit-gandhi's picture

sanchit-gandhi HF staff

Training in progress, step 500

895829e verified 10 months ago

runs
Training in progress, step 500 10 months ago
wandb
Training in progress, step 500 10 months ago
.gitattributes

1.52 kB

initial commit 10 months ago
config.json

655 Bytes

Training in progress, step 100 10 months ago
config_full.yaml

1.38 kB

Training in progress, step 100 10 months ago
deepspeed_zero3.yaml

498 Bytes

Training in progress, step 100 10 months ago
model.safetensors

3.14 GB
LFS

Training in progress, step 500 10 months ago
run_sft.py

7.05 kB

Training in progress, step 100 10 months ago
special_tokens_map.json

437 Bytes

Training in progress, step 100 10 months ago
tokenizer.json

1.8 MB

Training in progress, step 100 10 months ago
tokenizer.model

493 kB
LFS

Training in progress, step 100 10 months ago
tokenizer_config.json

1.39 kB

Training in progress, step 100 10 months ago
training_args.bin
Detected Pickle imports (12)
- "torch.device",
- "transformers.training_args.OptimizerNames",
- "transformers.trainer_utils.HubStrategy",
- "accelerate.utils.dataclasses.DeepSpeedPlugin",
- "transformers.trainer_utils.SchedulerType",
- "accelerate.state.PartialState",
- "transformers.integrations.deepspeed.HfTrainerDeepSpeedConfig",
- "torch.bfloat16",
- "transformers.integrations.deepspeed.HfDeepSpeedConfig",
- "alignment.configs.SFTConfig",
- "transformers.trainer_utils.IntervalStrategy",
- "accelerate.utils.dataclasses.DistributedType"
How to fix it?
5.82 kB
LFS

Training in progress, step 100 10 months ago