fakezeta commited on
Commit
77bf2b8
1 Parent(s): 4f9f9da

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +15 -0
README.md CHANGED
@@ -15,6 +15,21 @@ license_link: https://huggingface.co/meta-llama/Meta-Llama-3-8B/blob/main/LICENS
15
 
16
  # OpenVINO IR model with int8 quantization of llama-3-8B-Instruct
17
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
18
  ## Model Details
19
 
20
  Meta developed and released the Meta Llama 3 family of large language models (LLMs), a collection of pretrained and instruction tuned generative text models in 8 and 70B sizes. The Llama 3 instruction tuned models are optimized for dialogue use cases and outperform many of the available open source chat models on common industry benchmarks. Further, in developing these models, we took great care to optimize helpfulness and safety.
 
15
 
16
  # OpenVINO IR model with int8 quantization of llama-3-8B-Instruct
17
 
18
+ Model definition for LocalAI:
19
+ ```
20
+ name: llama3
21
+ backend: transformers
22
+ parameters:
23
+ model: fakezeta/llama-3-8b-instruct-ov-int8
24
+ context_size: 8192
25
+ type: OVModelForCausalLM
26
+ template:
27
+ use_tokenizer_template: true
28
+ stopwords:
29
+ - "<|eot_id|>"
30
+ - "<|end_of_text|>"
31
+ ```
32
+
33
  ## Model Details
34
 
35
  Meta developed and released the Meta Llama 3 family of large language models (LLMs), a collection of pretrained and instruction tuned generative text models in 8 and 70B sizes. The Llama 3 instruction tuned models are optimized for dialogue use cases and outperform many of the available open source chat models on common industry benchmarks. Further, in developing these models, we took great care to optimize helpfulness and safety.