TheBloke
/

StableBeluga2-70B-GGML

Text Generation

Model card Files Files and versions Community

TheBloke commited on Jul 29, 2023

Commit

70ef6df

•

1 Parent(s): 76d04c4

Update README.md

Files changed (1) hide show

README.md +3 -2

README.md CHANGED Viewed

@@ -39,8 +39,9 @@ quantized_by: TheBloke
 This repo contains GGML format model files for [Stability AI's StableBeluga 2](https://huggingface.co/stabilityai/StableBeluga2).
 These 70B Llama 2 GGML files currently only support CPU inference.  They are known to work with:
-* [llama.cpp](https://github.com/ggerganov/llama.cpp), commit `e76d630` and later.
-* [text-generation-webui](https://github.com/oobabooga/text-generation-webui), the most popular web UI.
 * [KoboldCpp](https://github.com/LostRuins/koboldcpp), version 1.37 and later. A powerful GGML web UI, especially good for story telling.
 * [llama-cpp-python](https://github.com/abetlen/llama-cpp-python), version 0.1.77 and later. A Python library with LangChain support, and OpenAI-compatible API server.

 This repo contains GGML format model files for [Stability AI's StableBeluga 2](https://huggingface.co/stabilityai/StableBeluga2).
 These 70B Llama 2 GGML files currently only support CPU inference.  They are known to work with:
+* [llama.cpp](https://github.com/ggerganov/llama.cpp), commit `e76d630` and later. CUDA GPU acceleration is supported. No Metal acceleration for macOS yet.
+* [text-generation-webui](https://github.com/oobabooga/text-generation-webui), the most widely used web UI. Supports CUDA GPU acceleration for 70B models.
+* [LM Studio](https://lmstudio.ai/), a fully featured local GUI with GPU acceleration. 70B GPU acceleration works on Windows, but not yet on macOS.
 * [KoboldCpp](https://github.com/LostRuins/koboldcpp), version 1.37 and later. A powerful GGML web UI, especially good for story telling.
 * [llama-cpp-python](https://github.com/abetlen/llama-cpp-python), version 0.1.77 and later. A Python library with LangChain support, and OpenAI-compatible API server.