Text Generation
Transformers
English
llama
TheBloke commited on
Commit
70ef6df
1 Parent(s): 76d04c4

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -2
README.md CHANGED
@@ -39,8 +39,9 @@ quantized_by: TheBloke
39
  This repo contains GGML format model files for [Stability AI's StableBeluga 2](https://huggingface.co/stabilityai/StableBeluga2).
40
 
41
  These 70B Llama 2 GGML files currently only support CPU inference. They are known to work with:
42
- * [llama.cpp](https://github.com/ggerganov/llama.cpp), commit `e76d630` and later.
43
- * [text-generation-webui](https://github.com/oobabooga/text-generation-webui), the most popular web UI.
 
44
  * [KoboldCpp](https://github.com/LostRuins/koboldcpp), version 1.37 and later. A powerful GGML web UI, especially good for story telling.
45
  * [llama-cpp-python](https://github.com/abetlen/llama-cpp-python), version 0.1.77 and later. A Python library with LangChain support, and OpenAI-compatible API server.
46
 
 
39
  This repo contains GGML format model files for [Stability AI's StableBeluga 2](https://huggingface.co/stabilityai/StableBeluga2).
40
 
41
  These 70B Llama 2 GGML files currently only support CPU inference. They are known to work with:
42
+ * [llama.cpp](https://github.com/ggerganov/llama.cpp), commit `e76d630` and later. CUDA GPU acceleration is supported. No Metal acceleration for macOS yet.
43
+ * [text-generation-webui](https://github.com/oobabooga/text-generation-webui), the most widely used web UI. Supports CUDA GPU acceleration for 70B models.
44
+ * [LM Studio](https://lmstudio.ai/), a fully featured local GUI with GPU acceleration. 70B GPU acceleration works on Windows, but not yet on macOS.
45
  * [KoboldCpp](https://github.com/LostRuins/koboldcpp), version 1.37 and later. A powerful GGML web UI, especially good for story telling.
46
  * [llama-cpp-python](https://github.com/abetlen/llama-cpp-python), version 0.1.77 and later. A Python library with LangChain support, and OpenAI-compatible API server.
47