royleibov
/

granite-7b-instruct-ZipNN-Compressed

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

royleibov commited on Sep 15

Commit

ddd9df0

•

1 Parent(s): a2767ea

Add use this model

Files changed (1) hide show

README.md +19 -3

README.md CHANGED Viewed

@@ -16,20 +16,36 @@ library_name: transformers
 This model is a clone of [**ibm-granite/granite-7b-instruct**](https://huggingface.co/ibm-granite/granite-7b-instruct) compressed using ZipNN. Compressed losslessly to 67% its original size, ZipNN saved ~5GB in storage and potentially ~30TB in data transfer **monthly**.
-## Requirement
 In order to use the model, ZipNN is necessary:
 ```bash
 pip install zipnn
 ```
-Then simply add at the beginning of the file
 ```python
 from zipnn import zipnn_hf
 zipnn_hf()
 ```
-And continue as usual. The patch will take care of decompressing the model correctly and safely.
 # Model Card for Granite-7b-lab [Paper](https://arxiv.org/abs/2403.01081)

 This model is a clone of [**ibm-granite/granite-7b-instruct**](https://huggingface.co/ibm-granite/granite-7b-instruct) compressed using ZipNN. Compressed losslessly to 67% its original size, ZipNN saved ~5GB in storage and potentially ~30TB in data transfer **monthly**.
+### Requirement
 In order to use the model, ZipNN is necessary:
 ```bash
 pip install zipnn
 ```
+### Use This Model
+```python
+# Use a pipeline as a high-level helper
+from transformers import pipeline
+from zipnn import zipnn_hf
+zipnn_hf()
+messages = [
+    {"role": "user", "content": "Who are you?"},
+]
+pipe = pipeline("text-generation", model="royleibov/granite-7b-instruct-ZipNN-Compressed")
+pipe(messages)
+```
 ```python
+# Load model directly
+from transformers import AutoTokenizer, AutoModelForCausalLM
 from zipnn import zipnn_hf
 zipnn_hf()
+tokenizer = AutoTokenizer.from_pretrained("royleibov/granite-7b-instruct-ZipNN-Compressed")
+model = AutoModelForCausalLM.from_pretrained("royleibov/granite-7b-instruct-ZipNN-Compressed")
 ```
 # Model Card for Granite-7b-lab [Paper](https://arxiv.org/abs/2403.01081)