Update README.md
Browse files
README.md
CHANGED
@@ -33,6 +33,11 @@ SHORTFORM_TO_FULL_TASK_TYPES = {
|
|
33 |
|
34 |
Bonito AWQ Usage Notebook [AWQ_Inference-Bonito.ipynb](https://huggingface.co/mychen76/Llama-3.1-8B-bonito-v1-awq/blob/main/AWQ_Inference-Bonito.ipynb)
|
35 |
|
|
|
|
|
|
|
|
|
|
|
36 |
### Using Quantized Bonito
|
37 |
|
38 |
```
|
|
|
33 |
|
34 |
Bonito AWQ Usage Notebook [AWQ_Inference-Bonito.ipynb](https://huggingface.co/mychen76/Llama-3.1-8B-bonito-v1-awq/blob/main/AWQ_Inference-Bonito.ipynb)
|
35 |
|
36 |
+
include custom class:
|
37 |
+
- AWQBonito (inference based on AutoAWQForCausalLM)
|
38 |
+
- VLLMBonito (inference based on vLLM)
|
39 |
+
|
40 |
+
|
41 |
### Using Quantized Bonito
|
42 |
|
43 |
```
|