Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,76 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: cc-by-nc-nd-4.0
|
3 |
+
datasets:
|
4 |
+
- Open-Orca/SlimOrca
|
5 |
+
- ajibawa-2023/SlimOrca-ShareGPT
|
6 |
+
language:
|
7 |
+
- en
|
8 |
+
---
|
9 |
+
Original model: [SlimOrca-13B](https://huggingface.co/ajibawa-2023/SlimOrca-13B)
|
10 |
+
Model creator: [ajibawa-2023](https://huggingface.co/ajibawa-2023)
|
11 |
+
## SlimOrca-13B-exl2
|
12 |
+
|
13 |
+
[4bpw h8 (main)](https://huggingface.co/cgus/SlimOrca-13B-exl2/tree/main)
|
14 |
+
|
15 |
+
## Original model card:
|
16 |
+
|
17 |
+
**SlimOrca-13B: A General Purpose Intelligent Model**
|
18 |
+
|
19 |
+
|
20 |
+
This Model is trained on refined version of SlimOrca made available by [Open-Orca](https://huggingface.co/Open-Orca) team.
|
21 |
+
The idea was to check how this Model will perform in the absence of "system" prompt/instruction.
|
22 |
+
This Model is very good in various types of General Purpose content generation such as Q&A (including multiple choice), Articles from Summary, Sentiment Analysis,
|
23 |
+
Context & Hypothesis, Reviews, Erotic story generation etc.
|
24 |
+
It can also generate Uncensored content. Kindly be careful while generating Uncensored content as you will be responsible for what you
|
25 |
+
generate.
|
26 |
+
|
27 |
+
It is trained on 517981 set of conversations. Each set having 2 conversations. I have shared this [data](https://huggingface.co/datasets/ajibawa-2023/SlimOrca-ShareGPT).
|
28 |
+
|
29 |
+
All the credit goes to the Open-Orca team for releasing SlimOrca dataset.
|
30 |
+
|
31 |
+
|
32 |
+
**Training:**
|
33 |
+
Entire dataset was trained on Azure 4 x A100 80GB. For 3 epoch, training took almost 11 Days. DeepSpeed codebase was used for training purpose.
|
34 |
+
Entire data is trained on Llama-2 by Meta.
|
35 |
+
|
36 |
+
This is a full fine tuned model. Links for quantized models are given below.
|
37 |
+
|
38 |
+
**GPTQ GGML & AWQ**
|
39 |
+
|
40 |
+
GPTQ: TBA
|
41 |
+
|
42 |
+
GGUF: TBA
|
43 |
+
|
44 |
+
AWQ: TBA
|
45 |
+
|
46 |
+
|
47 |
+
|
48 |
+
**Example Prompt:**
|
49 |
+
```
|
50 |
+
This is a conversation with your Assistant. It is a computer program designed to help you with various tasks such as answering questions, providing recommendations, and helping with decision making. You can ask it anything you want and it will do its best to give you accurate and relevant information.
|
51 |
+
|
52 |
+
Context
|
53 |
+
You are a helpful AI assistant.
|
54 |
+
|
55 |
+
USER: <prompt>
|
56 |
+
ASSISTANT:
|
57 |
+
```
|
58 |
+
You can modify above Prompt as per your requirement. I have used ShareGPT/Vicuna format v1.1 .
|
59 |
+
|
60 |
+
|
61 |
+
I want to say special Thanks to the Open Source community for helping & guiding me to better understand the AI/Model development.
|
62 |
+
|
63 |
+
Thank you for your love & support.
|
64 |
+
|
65 |
+
|
66 |
+
**Example Output**
|
67 |
+
|
68 |
+
Example 1
|
69 |
+
|
70 |
+
![Example 1](https://cdn-uploads.huggingface.co/production/uploads/64aea8ff67511bd3d965697b/hM_EJaSZiMjMQU35EiHGM.png)
|
71 |
+
|
72 |
+
Example 2
|
73 |
+
|
74 |
+
![Example 2](https://cdn-uploads.huggingface.co/production/uploads/64aea8ff67511bd3d965697b/riNaxJeTWdCEE4dNP8GWp.png)
|
75 |
+
|
76 |
+
|