Steelskull
commited on
Commit
•
05532f6
1
Parent(s):
2678704
Update README.md
Browse files
README.md
CHANGED
@@ -13,6 +13,9 @@ tags:
|
|
13 |
|
14 |
An attempt to make a functional goliath style merge to create a [Etheria] 55b-200k with two yi-34b-200k models.
|
15 |
|
|
|
|
|
|
|
16 |
This is a merge of both VerA and VerB of Etheria-55b (There numbers were surprisingly good), I then created a sacrificial 55B out of the most performant yi-34b-200k Model
|
17 |
and performed a Dare_ties merge and equalize the model into its current state.
|
18 |
|
|
|
13 |
|
14 |
An attempt to make a functional goliath style merge to create a [Etheria] 55b-200k with two yi-34b-200k models.
|
15 |
|
16 |
+
due to the merge it 'theoretically' should have a context of 200k but I recommend starting at 32k and moveing up,
|
17 |
+
as it is unknown (at this time) what the merge has done to the context length.
|
18 |
+
|
19 |
This is a merge of both VerA and VerB of Etheria-55b (There numbers were surprisingly good), I then created a sacrificial 55B out of the most performant yi-34b-200k Model
|
20 |
and performed a Dare_ties merge and equalize the model into its current state.
|
21 |
|