Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,17 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
language:
|
3 |
+
- en
|
4 |
+
library_name: transformers
|
5 |
+
---
|
6 |
+
Converted with https://github.com/qwopqwop200/GPTQ-for-LLaMa
|
7 |
+
All models tested on A100-80G
|
8 |
+
|
9 |
+
Installation instructions as mentioned in above repo:
|
10 |
+
1. Install Anaconda and create a venv with python 3.8
|
11 |
+
2. Install pytorch(tested with torch-1.13-cu116)
|
12 |
+
3. Install Transformers library (you'll need the latest transformers with this PR : https://github.com/huggingface/transformers/pull/21955 ).
|
13 |
+
4. Install sentencepiece from pip
|
14 |
+
5. Run python cuda_setup.py install in venv
|
15 |
+
6. You can either convert the llama models yourself with the instructions from GPTQ-for-llama repo
|
16 |
+
7. or directly use these weights by individually downloading them following these instructions (https://huggingface.co/docs/huggingface_hub/guides/download)
|
17 |
+
8. Profit!
|