chargoddard
commited on
Commit
•
4c00383
1
Parent(s):
ef3f19c
Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,13 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: apache-2.0
|
3 |
+
datasets:
|
4 |
+
- Open-Orca/SlimOrca
|
5 |
+
language:
|
6 |
+
- en
|
7 |
+
---
|
8 |
+
|
9 |
+
Full weight fine tuned on two epochs of [SlimOrca](https://huggingface.co/datasets/Open-Orca/SlimOrca). Uses Mistral Instruct's prompt format.
|
10 |
+
|
11 |
+
The base model for this came from a variation on Undi's [Mistral 11B recipe](https://huggingface.co/Undi95/Mistral-11B-v0.1). The `o_proj` and `down_proj` tensors were set to zero in the added layers, making the output exactly identical to Mistral 7B before training.
|
12 |
+
|
13 |
+
Benchmarks look good locally but still evaluating actual usefulness.
|