|
--- |
|
base_model: |
|
- wzhouad/gemma-2-9b-it-WPO-HB |
|
- google/gemma-2-9b-it |
|
- princeton-nlp/gemma-2-9b-it-SimPO |
|
library_name: transformers |
|
tags: |
|
- mergekit |
|
- merge |
|
|
|
--- |
|
# Gemma Advanced V1 (obsolete) |
|
|
|
Note: A much-improved version is available at [jsgreenawalt/gemma-2-9B-it-advanced-v2.1](https://huggingface.co/jsgreenawalt/gemma-2-9B-it-advanced-v2.1) |
|
|
|
Experimental merge #1, attempting to combine some of the advanced Gemma fine-tunes |
|
|
|
Quants are available here: https://huggingface.co/QuantFactory/gemma-advanced-v1-GGUF |
|
|
|
Notes and observations: |
|
* Recommended temperature 0.15 or lower , the model is more temperature sensitive than the parent models |
|
* Recommended Q8_0 quant, Q6* and lower quants lose more than quality than expected |
|
* The model writes coherently (at lower temperatures) and has a different writing style than the parent models |
|
|
|
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). |
|
|
|
## Merge Details |
|
### Merge Method |
|
|
|
This model was merged using the della merge method using [google/google-gemma-2-9b-it](https://huggingface.co/google/gemma-2-9b-it) as a base. |
|
|
|
### Models Merged |
|
|
|
The following models were included in the merge: |
|
* [wzhouad/gemma-2-9b-it-WPO-HB](https://huggingface.co/wzhouad/gemma-2-9b-it-WPO-HB) |
|
* [princeton-nlp/gemma-2-9b-it-SimPO](https://huggingface.co/princeton-nlp/gemma-2-9b-it-SimPO) |
|
|
|
### Configuration |
|
|
|
The following YAML configuration was used to produce this model: |
|
|
|
```yaml |
|
models: |
|
- model: google/gemma-2-9b-it |
|
# no parameters necessary for base model |
|
- model: princeton-nlp/gemma-2-9b-it-SimPO |
|
parameters: |
|
density: 0.5 |
|
weight: 0.5 |
|
- model: wzhouad/gemma-2-9b-it-WPO-HB |
|
parameters: |
|
density: 0.5 |
|
weight: 0.5 |
|
merge_method: della |
|
base_model: google/gemma-2-9b-it |
|
parameters: |
|
normalize: true |
|
dtype: float16 |
|
|
|
``` |
|
|