luigi86
/

magnum-v2-72b_mlx-4bpw

Text Generation

4-bit precision

Model card Files Files and versions Community

luigi86 commited on Oct 21

Commit

d9f97ff

•

1 Parent(s): e98fa4c

Update README.md

Files changed (1) hide show

README.md +0 -8

README.md CHANGED Viewed

@@ -121,14 +121,6 @@ model-index:
 Quantized to 4 bpw precision and tested using the `mlx_lm` utility on a 64GiB URAM M1 Max.
-## Notes on using:
-Requires and optimized for Apple Silicon. Fast enough for rapid back-and-forth as long as it fits on your URAM.
-I tried to serve this with `mlx_lm.serve` per usual, but I got python string indexing errors no matter what I did. It works fine with LM Studio in OpenAI mode.
-I used this with SillyTavern, it worked well.
 See [original model](https://huggingface.co/anthracite-org/magnum-v2-72b) for further details.
 Larger, 8bpw quants available at [mlx-community](https://huggingface.co/mlx-community/magnum-v2-72b).

 Quantized to 4 bpw precision and tested using the `mlx_lm` utility on a 64GiB URAM M1 Max.
 See [original model](https://huggingface.co/anthracite-org/magnum-v2-72b) for further details.
 Larger, 8bpw quants available at [mlx-community](https://huggingface.co/mlx-community/magnum-v2-72b).