Create README.md
Browse files
README.md
CHANGED
@@ -0,0 +1,48 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
language:
|
3 |
+
- en
|
4 |
+
pipeline_tag: text-generation
|
5 |
+
datasets:
|
6 |
+
- nlpai-lab/databricks-dolly-15k-ko
|
7 |
+
- kyujinpy/KOR-OpenOrca-Platypus-v3
|
8 |
+
- KETI-AIR/kor_boolq
|
9 |
+
- heegyu/open-korean-instructions
|
10 |
+
license: cc-by-sa-4.0
|
11 |
+
---
|
12 |
+
|
13 |
+
**Input** Models input text only.
|
14 |
+
|
15 |
+
**Output** Models generate text only.
|
16 |
+
|
17 |
+
**Base Model** [upstage/SOLAR-10.7B-Instruct-v1.0](https://huggingface.co/upstage/SOLAR-10.7B-Instruct-v1.0)
|
18 |
+
|
19 |
+
**Training Dataset**
|
20 |
+
- [nlpai-lab/databricks-dolly-15k-ko](https://huggingface.co/datasets/nlpai-lab/databricks-dolly-15k-ko)
|
21 |
+
- [kyujinpy/KOR-OpenOrca-Platypus-v3](https://huggingface.co/datasets/kyujinpy/KOR-OpenOrca-Platypus-v3)
|
22 |
+
- [heegyu/open-korean-instructions](heegyu/open-korean-instructions)
|
23 |
+
- [KETI-AIR/kor_boolq](https://huggingface.co/datasets/KETI-AIR/kor_boolq)
|
24 |
+
- [AIhub μν λ²μ λ°μ΄ν° μΌλΆ](https://aihub.or.kr/aihubdata/data/view.do?currMenu=115&topMenu=100&dataSetSn=71593)
|
25 |
+
|
26 |
+
# Implementation Code
|
27 |
+
```python
|
28 |
+
from transformers import AutoModelForCausalLM, AutoTokenizer
|
29 |
+
import torch
|
30 |
+
repo = "ifuseok/sft-solar-10.7b-v1.1"
|
31 |
+
OpenOrca = AutoModelForCausalLM.from_pretrained(
|
32 |
+
repo,
|
33 |
+
return_dict=True,
|
34 |
+
torch_dtype=torch.float16,
|
35 |
+
device_map='auto'
|
36 |
+
)
|
37 |
+
OpenOrca_tokenizer = AutoTokenizer.from_pretrained(repo)
|
38 |
+
```
|
39 |
+
|
40 |
+
# Prompt Example
|
41 |
+
```
|
42 |
+
### System:
|
43 |
+
μμ€ν
λ©μμ§ μ
λλ€.
|
44 |
+
### User:
|
45 |
+
μ μ μ
λλ€.
|
46 |
+
### Assistant
|
47 |
+
μ΄μμ€ν΄νΈ μ
λλ€.
|
48 |
+
```
|