license: apache-2.0 | |
Experiment, can DUF can be taken one or more steps further? | |
8 layers removed from both models, as per original paper, base version of upstage/SOLAR-10.7B-v1.0 used for merge |
license: apache-2.0 | |
Experiment, can DUF can be taken one or more steps further? | |
8 layers removed from both models, as per original paper, base version of upstage/SOLAR-10.7B-v1.0 used for merge |