Lambent
/

qwen2.5-14B-selfmerge-A

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Lambent commited on Sep 23

Commit

ce755ef

•

1 Parent(s): 0fa5839

Update README.md

Files changed (1) hide show

README.md +12 -0

README.md CHANGED Viewed

@@ -12,6 +12,18 @@ tags:
 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
 ## Merge Details
 ### Merge Method

 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
+Re-injected base model into instruct model in the intermediate layers while keeping input and output layers the same (sophosympatheia gradient).
+While this did degrade the overall score of the model compared to instruct in EQ-bench testing (76.9195 down to 73.8068),
+it removed its issue with misspelling some of the emotion responses and remains notably higher than the base model
+(60.1027 but without any syntax errors).
+It did throw in one non-mispelled "didn't match reference" syntax error, I presume it replaced the emotion entirely or used a similar grammatically correct one.
+Looking at this as research evidence, it seems like the instruct model picked up something hurting the spelling occasionally specifically in the intermediate layers?
+I don't know if there's any other gain from this merge compared to using one or both components, this was for curiosity.
+Might still be useful as more-compact merge materials if you wanted both base and instruct anyway.
 ## Merge Details
 ### Merge Method