spmurrayzzz
commited on
Commit
•
f890203
1
Parent(s):
6780f05
Update README.md
Browse files
README.md
CHANGED
@@ -26,6 +26,31 @@ the training dynamics specific to large language models. The dataset used in fin
|
|
26 |
a "syndicate" of other open language models both of similar parameter size and larger. Each model would generate a
|
27 |
response for a given instruction, and the group would vote on which model's response was best.
|
28 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
29 |
## Evaluation Results
|
30 |
_12.30.23_
|
31 |
| Benchmark | Result |
|
|
|
26 |
a "syndicate" of other open language models both of similar parameter size and larger. Each model would generate a
|
27 |
response for a given instruction, and the group would vote on which model's response was best.
|
28 |
|
29 |
+
The instruction inputs used for the output label synthesis were a curated subset of [VMWare/open-instruct](https://huggingface.co/datasets/VMware/open-instruct)
|
30 |
+
with additional instructions synthesized from scratch.
|
31 |
+
|
32 |
+
## Prompt template
|
33 |
+
|
34 |
+
With context
|
35 |
+
```
|
36 |
+
Below is an instruction that describes a task, paired with an input that provides further context. Write a response that appropriately completes the request.
|
37 |
+
|
38 |
+
### Instruction:
|
39 |
+
|
40 |
+
### Input:
|
41 |
+
|
42 |
+
### Response:
|
43 |
+
```
|
44 |
+
|
45 |
+
Without context
|
46 |
+
```
|
47 |
+
Below is an instruction that describes a task. Write a response that appropriately completes the request.
|
48 |
+
|
49 |
+
### Instruction:
|
50 |
+
|
51 |
+
### Response:
|
52 |
+
```
|
53 |
+
|
54 |
## Evaluation Results
|
55 |
_12.30.23_
|
56 |
| Benchmark | Result |
|