ifuseok commited on
Commit
023bb35
β€’
1 Parent(s): 914f3c7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +45 -0
README.md CHANGED
@@ -1,3 +1,48 @@
1
  ---
 
 
 
 
 
 
 
 
2
  license: cc-by-nc-sa-4.0
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ language:
3
+ - en
4
+ pipeline_tag: text-generation
5
+ datasets:
6
+ - nlpai-lab/databricks-dolly-15k-ko
7
+ - kyujinpy/KOR-OpenOrca-Platypus-v3
8
+ - KETI-AIR/kor_boolq
9
+ - heegyu/open-korean-instructions
10
  license: cc-by-nc-sa-4.0
11
  ---
12
+
13
+ **Input** Models input text only.
14
+
15
+ **Output** Models generate text only.
16
+
17
+ **Base Model** [yanolja/KoSOLAR-10.7B-v0.1](https://huggingface.co/yanolja/KoSOLAR-10.7B-v0.1-deprecated)
18
+
19
+ **Training Dataset**
20
+ - [nlpai-lab/databricks-dolly-15k-ko](https://huggingface.co/datasets/nlpai-lab/databricks-dolly-15k-ko)
21
+ - [kyujinpy/KOR-OpenOrca-Platypus-v3](https://huggingface.co/datasets/kyujinpy/KOR-OpenOrca-Platypus-v3)
22
+ - [heegyu/open-korean-instructions](heegyu/open-korean-instructions)
23
+ - [KETI-AIR/kor_boolq](https://huggingface.co/datasets/KETI-AIR/kor_boolq)
24
+ - [AIhub μ˜ν•œ λ²ˆμ—­ 데이터 일뢀](https://aihub.or.kr/aihubdata/data/view.do?currMenu=115&topMenu=100&dataSetSn=71593)
25
+
26
+ # Implementation Code
27
+ ```python
28
+ from transformers import AutoModelForCausalLM, AutoTokenizer
29
+ import torch
30
+ repo = "ifuseok/sft-solar-10.7b-v2.1-dpo"
31
+ OpenOrca = AutoModelForCausalLM.from_pretrained(
32
+ repo,
33
+ return_dict=True,
34
+ torch_dtype=torch.float16,
35
+ device_map='auto'
36
+ )
37
+ OpenOrca_tokenizer = AutoTokenizer.from_pretrained(repo)
38
+ ```
39
+
40
+ # Prompt Example
41
+ ```
42
+ ### System:
43
+ μ‹œμŠ€ν…œ λ©”μ‹œμ§€ μž…λ‹ˆλ‹€.
44
+ ### User:
45
+ μœ μ € μž…λ‹ˆλ‹€.
46
+ ### Assistant
47
+ μ–΄μ‹œμŠ€ν„΄νŠΈ μž…λ‹ˆλ‹€.
48
+ ```