dicta-il
/

dictalm2.0-AWQ

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions Community

Shaltiel commited on Apr 14

Commit

a4e6f73

•

1 Parent(s): 5fd6d14

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ inference:
 # Model Card for DictaLM-2.0-AWQ
-The DictaLM-2.0 Large Language Model (LLM) is a pretrained generative text model with 7 billion parameters specializing in Hebrew.
 For full details of this model please read our [release blog post](https://example.com).
@@ -56,7 +56,7 @@ print(model(prompt.strip(), do_sample=False, max_new_tokens=4, stop_sequence='\n
 ## Model Architecture
 DictaLM-2.0 is based on the [Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) model with the following changes:
-- An extended tokenizer with tokens for Hebrew, increasing the compression ratio
 - Continued pretraining on over 190B tokens of naturally occuring text, 50% Hebrew and 50% English.
 ## Notice

 # Model Card for DictaLM-2.0-AWQ
+The DictaLM-2.0 Large Language Model (LLM) is a pretrained generative text model with 7 billion parameters trained to specialize in Hebrew text.
 For full details of this model please read our [release blog post](https://example.com).
 ## Model Architecture
 DictaLM-2.0 is based on the [Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) model with the following changes:
+- An extended tokenizer with 1,000 injected tokens specifically for Hebrew, increasing the compression rate from 5.78 tokens/word to 2.76 tokens/word.
 - Continued pretraining on over 190B tokens of naturally occuring text, 50% Hebrew and 50% English.
 ## Notice