dfurman
/

Llama-2-13B-Instruct-v0.2

Text Generation

Model card Files Files and versions Community

dfurman commited on Nov 17, 2023

Commit

a3a1367

•

1 Parent(s): a81db74

Update README.md

Files changed (1) hide show

README.md +5 -5

README.md CHANGED Viewed

@@ -19,19 +19,19 @@ base_model: meta-llama/Llama-2-13b-hf
 </div>
-# Mistral-7B-Instruct-v0.2
 A pretrained generative language model with 7 billion parameters geared towards instruction-following capabilities.
 ## Model Details
-This model was built via parameter-efficient finetuning of the [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) base model on the first 20k rows in each of the [jondurbin/airoboros-2.2.1](https://huggingface.co/datasets/jondurbin/airoboros-2.2.1), [Open-Orca/SlimOrca](https://huggingface.co/datasets/Open-Orca/SlimOrca), and [garage-bAInd/Open-Platypus](https://huggingface.co/datasets/garage-bAInd/Open-Platypus) datasets.
 - **Developed by:** Daniel Furman
 - **Model type:** Decoder-only
 - **Language(s) (NLP):** English
 - **License:** Apache 2.0
-- **Finetuned from model:** [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1)
 ## Model Sources
@@ -69,7 +69,7 @@ from transformers import (
 ```
 ```python
-peft_model_id = "dfurman/Mistral-7B-Instruct-v0.2"
 config = PeftConfig.from_pretrained(peft_model_id)
 tokenizer = AutoTokenizer.from_pretrained(
@@ -166,7 +166,7 @@ Ice cubes to fill the shaker
 ## Training
-It took ~5 hours to train 3 epochs on 1x A100 (40 GB SXM).
 ### Prompt Format

 </div>
+# Llama-2-13B-Instruct-v0.2
 A pretrained generative language model with 7 billion parameters geared towards instruction-following capabilities.
 ## Model Details
+This model was built via parameter-efficient finetuning of the [meta-llama/Llama-2-13b-hf](https://huggingface.co/meta-llama/Llama-2-13b-hf) base model on the first 20k rows in each of the [jondurbin/airoboros-2.2.1](https://huggingface.co/datasets/jondurbin/airoboros-2.2.1), [Open-Orca/SlimOrca](https://huggingface.co/datasets/Open-Orca/SlimOrca), and [garage-bAInd/Open-Platypus](https://huggingface.co/datasets/garage-bAInd/Open-Platypus) datasets.
 - **Developed by:** Daniel Furman
 - **Model type:** Decoder-only
 - **Language(s) (NLP):** English
 - **License:** Apache 2.0
+- **Finetuned from model:** [meta-llama/Llama-2-13b-hf](https://huggingface.co/meta-llama/Llama-2-13b-hf)
 ## Model Sources
 ```
 ```python
+peft_model_id = "dfurman/Llama-2-13B-Instruct-v0.2"
 config = PeftConfig.from_pretrained(peft_model_id)
 tokenizer = AutoTokenizer.from_pretrained(
 ## Training
+It took ~7 hours to train 3 epochs on 1x A100 (40 GB SXM).
 ### Prompt Format