pankajmathur
/

orca_alpaca_3b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Pankaj Mathur commited on Jun 22, 2023

Commit

ad636d6

•

1 Parent(s): 8889e72

Update README.md

Files changed (1) hide show

README.md +11 -5

README.md CHANGED Viewed

@@ -4,14 +4,20 @@ language:
 - en
 library_name: adapter-transformers
 ---
-# alpaca_orca_open_llama: An Open_LLaMA-3B model trained on Alpaca dataset using Orca Research paper approaches
-# Dataset and Training
-We train OpenLLaMa-3B model to become more steerable by training it on the custom Alpaca dataset created using approaches from [Orca Research Paper](https://arxiv.org/abs/2306.02707).
-Please pay attention how the **System** prompt is added before each *instruction*.
 The training configurations are provided in the table below.
@@ -84,4 +90,4 @@ Next Goals:
 2) Try bigger OpenLLaMA models 7B and 13B
 3) Try better GPU for training, couldn't get 8xA100 (40GB), I guess they are in hot demand now.
 4) Provide more options for Text generation UI. (may be https://github.com/oobabooga/text-generation-webui)
-6) Provide 4bit quantized model (may be TheBloke can help here)

 - en
 library_name: adapter-transformers
 ---
+# alpaca_orca_open_llama: An Open_LLaMA-3B model trained on custom Alpaca dataset using Orca Research paper approaches
+# Dataset
+We train OpenLLaMa-3B model on custom explained tuned Alpaca dataset (~52K) created using approaches from [Orca Research Paper](https://arxiv.org/abs/2306.02707).
+We leverage all of the 15 system instructions provided in [Orca Research Paper](https://arxiv.org/abs/2306.02707) to generate custom Alpaca dataset, in contrast to vanilla instruction tuning approaches used by original [Alpaca research paper](https://crfm.stanford.edu/2023/03/13/alpaca.html).
+This helps student model aka [alpaca_orca_open_llama_3b](psmathur/alpaca_orca_open_llama_3b) to learn ***thought*** process from teacher model, which is ChatGPT (gpt-3.5-turbo-0301 version).
+Please pay attention how the **System** prompt is added before each *instruction* in below example usage.
+# Training
 The training configurations are provided in the table below.
 2) Try bigger OpenLLaMA models 7B and 13B
 3) Try better GPU for training, couldn't get 8xA100 (40GB), I guess they are in hot demand now.
 4) Provide more options for Text generation UI. (may be https://github.com/oobabooga/text-generation-webui)
+6) Provide 4bit GGML/GPTQ quantized model (may be [TheBloke](https://huggingface.co/TheBloke) can help here)