Edit model card

Model Card for Model ID

This a model is a fine-tuned with SFT using DeepSpeed Chat. It is based on OPT-1.3M.B

Model Details

Model Description

Model Sources

The model has been trained with the procedure described in this article:

Train Instruct LLMs On Your GPU with DeepSpeed Chat — Step #1: Supervised Fine-tuning

Downloads last month
3
Safetensors
Model size
1.32B params
Tensor type
FP16
·
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Datasets used to train kaitchup/OPT-1.3B-SFT-DSChatLoRA