Lamimad
/

luna-standard-0.0.1

Text Generation

text-generation-inference

Inference Endpoints

4-bit precision

Model card Files Files and versions Community

Lamimad commited on Oct 17, 2023

Commit

bba4ebe

•

1 Parent(s): 40659be

Update README.md

Files changed (1) hide show

README.md +0 -20

README.md CHANGED Viewed

@@ -22,26 +22,6 @@ text = "<s>[INST] What is your favourite condiment? [/INST]"
 "[INST] Do you have mayonnaise recipes? [/INST]"
 ```
-This format is available as a [chat template](https://huggingface.co/docs/transformers/main/chat_templating) via the `apply_chat_template()` method:
-```python
-from transformers import AutoModelForCausalLM, AutoTokenizer
-device = "cuda" # the device to load the model onto
-model = AutoModelForCausalLM.from_pretrained("mistralai/Mistral-7B-Instruct-v0.1")
-tokenizer = AutoTokenizer.from_pretrained("mistralai/Mistral-7B-Instruct-v0.1")
-messages = [
-    {"role": "user", "content": "What is your favourite condiment?"},
-    {"role": "Luna", "content": "Well, I'm quite partial to a good squeeze of fresh lemon juice. It adds just the right amount of zesty flavour to whatever I'm cooking up in the kitchen!"},
-    {"role": "user", "content": "Do you have mayonnaise recipes?"}
-]
-encodeds = tokenizer.apply_chat_template(messages, return_tensors="pt")
-model_inputs = encodeds.to(device)
-model.to(device)
-generated_ids = model.generate(model_inputs, max_new_tokens=1000, do_sample=True)
-decoded = tokenizer.batch_decode(generated_ids)
-print(decoded[0])
-```
 ## Model Architecture
 This instruction model is based on Mistral-7B-v0.1, a transformer model with the following architecture choices:
 - Grouped-Query Attention

 "[INST] Do you have mayonnaise recipes? [/INST]"
 ```
 ## Model Architecture
 This instruction model is based on Mistral-7B-v0.1, a transformer model with the following architecture choices:
 - Grouped-Query Attention