Zyphra
/

Zamba-7B-v1-phase1

Text Generation

Inference Endpoints

Model card Files Files and versions Community

pglo commited on May 22

Commit

b4be458

•

1 Parent(s): 19d07cb

Update README.md

Files changed (1) hide show

README.md +5 -3

README.md CHANGED Viewed

@@ -14,13 +14,15 @@ Zamba requires you use `transformers` version 4.39.0 or higher:
 pip install transformers>=4.39.0
 ```
-In order to run optimized Mamba implementations, you first need to install `mamba-ssm` and `causal-conv1d`:
 ```bash
 pip install mamba-ssm causal-conv1d>=1.2.0
 ```
-You also have to have the model on a CUDA device.
-You can run the model not using the optimized Mamba kernels, but it is **not** recommended as it will result in significantly higher latency. In order to do that, you'll need to specify `use_mamba_kernels=False` when loading the model.
 ## Inference

 pip install transformers>=4.39.0
 ```
+In order to run optimized Mamba implementations on a CUDA device, you first need to install `mamba-ssm` and `causal-conv1d`:
 ```bash
 pip install mamba-ssm causal-conv1d>=1.2.0
 ```
+You can run the model not using the optimized Mamba kernels, but it is **not** recommended as it will result in significantly higher latency.
+To run on CPU, please specify `use_mamba_kernels=False` when loading the model using ``AutoModelForCausalLM.from_pretrained``.
 ## Inference