Model Card for Model ID
This a model is a chat model fine-tuned with RLHF using DeepSpeed Chat and LoRA. It is based on OPT1.3B.
Model Details
Model Description
- Developed by: The Kaitchup
- Model type: Causal
- Language(s) (NLP): English
- License: cc-by-nc-sa-4.0
- Finetuned from model: facebook/opt-1.3b
Model Sources
The model has been trained with the procedure described in this article:
- Downloads last month
- 18
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.