dnhkng
/

RYS-XLarge

@@ -104,7 +104,7 @@ This is a new kind of model optimization. It is based on a new method for the an
 ### Model improvement with layer duplication:
 |                 | Average | IFEval | BBH  | MATH Lvl 5 | GPQA | MUSR  | MMLU-PRO |
 |-----------------|--------:|-------:|-----:|-----------:|-----:|------:|---------:|
-| RYS Improvement |    2.61 |  -2.05 | 2.51 |       8.16 | 2.58 | 17.72 |     0.31 |
 This model is based on MaziyarPanahi/calme-2.1-qwen2-72b, which in turn was tuned from Qwen2-72B. As this method is orthogonal to fine-tuning, the further finetune from MaziyarPanahi now has the top position:

 ### Model improvement with layer duplication:
 |                 | Average | IFEval | BBH  | MATH Lvl 5 | GPQA | MUSR  | MMLU-PRO |
 |-----------------|--------:|-------:|-----:|-----------:|-----:|------:|---------:|
+| RYS Improvement |    2.61% |  -2.05% | 2.51% |       8.16% | 2.58% | 17.72% |     0.31% |
 This model is based on MaziyarPanahi/calme-2.1-qwen2-72b, which in turn was tuned from Qwen2-72B. As this method is orthogonal to fine-tuning, the further finetune from MaziyarPanahi now has the top position: