dnhkng
/

RYS-XLarge

@@ -103,8 +103,8 @@ This is a new kind of model optimization. It is based on a new method for the an
 ### Model improvement with layer duplication:
 |                 | Average | IFEval | BBH  | MATH Lvl 5 | GPQA | MUSR  | MMLU-PRO |
-|-----------------|---------|--------|------|------------|------|-------|----------|
-| RYS Improvement |   2.61% | -2.05% |2.51% |      8.16% |2.58% |17.72% |    0.31% |
 This model is based on MaziyarPanahi/calme-2.1-qwen2-72b, which in turn was tuned from Qwen2-72B. As this method is orthogonal to fine-tuning, the further finetune from MaziyarPanahi now has the top position:

 ### Model improvement with layer duplication:
 |                 | Average | IFEval | BBH  | MATH Lvl 5 | GPQA | MUSR  | MMLU-PRO |
+|-----------------|--------:|-------:|-----:|-----------:|-----:|------:|---------:|
+| RYS Improvement |    2.61 |  -2.05 | 2.51 |       8.16 | 2.58 | 17.72 |     0.31 |
 This model is based on MaziyarPanahi/calme-2.1-qwen2-72b, which in turn was tuned from Qwen2-72B. As this method is orthogonal to fine-tuning, the further finetune from MaziyarPanahi now has the top position: