File size: 2,912 Bytes
9d9d446 2b2cdfb fea6d0f de3728e 2b2cdfb 25dc92a de3728e 747a99d 9d9d446 747a99d b8e6057 747a99d b8e6057 747a99d b8e6057 747a99d 25dc92a 747a99d 9d2fa35 de3728e 2b2cdfb |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 |
---
base_model:
- inflatebot/helide-beta-r3
- inflatebot/helide-beta-r1
- inflatebot/helide-beta-r4
- inflatebot/helide-beta-r0
library_name: transformers
tags:
- mergekit
- merge
- not-for-all-audiences
---
# L3-8B-Helium3
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
![Made with NovelAI](https://huggingface.co/inflatebot/l3-helium3-8B/resolve/main/img.png)
`Helium-3 (He-3) is a stable isotope of helium with two protons and one neutron. Outside of ordinary hydrogen, He-3 is the only stable isotope of any element with more protons than neutrons. It's potentially useful as fuel for nuclear fusion power.`
## Merge Details
There was a problem with the Helide beta. 3 models resulted, each of which had different strengths. But they came about as a result of balancing two models.
There wasn't going to be a way to get the best of all three worlds just by tweaking a SLERP ratio. The math wasn't quite mathing.
But there were three of them.
The name was serendipity.
The layup was obscene.
But I *live* for the bit.
Helium-3 is a RP and storywriting hybrid, ultimately based on Sao10K's [Stheno](https://huggingface.co/Sao10K/L3-8B-Stheno-v3.2) and Fizzarolli's [Rosier](https://huggingface.co/Fizzarolli/L3-8b-Rosier-v1), and the culmination of the Helide project.
Combining Rosier's prose and knowledge of niche fetish with Stheno's steerability and crackling personality, Helium-3 brings the advancements of modern AI models to the Freaks™.
They'll chew you up and spit you out just as readily as they'll shower you with affection.
I'm genuinely proud of this one. This is the model I wish existed.
Thank you to [Fizzarolli](https://huggingface.co/Fizzarolli) for consulting and providing technical assistance which accelerated the second leg of this project from several weeks into a single night, and for making the Rosier model that made this possible. On several levels, He-3 wouldn't have been possible without her.
Quants available from [mradermacher](https://huggingface.co/mradermacher/L3-8B-Helium3-GGUF)
### Merge Method
This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using [inflatebot/helide-beta-r1](https://huggingface.co/inflatebot/helide-beta-r1) as a base.
### Models Merged
The following models were included in the merge:
* [inflatebot/helide-beta-r3](https://huggingface.co/inflatebot/helide-beta-r3)
* [inflatebot/helide-beta-r4](https://huggingface.co/inflatebot/helide-beta-r4)
* [inflatebot/helide-beta-r0](https://huggingface.co/inflatebot/helide-beta-r0)
### Configuration
The following YAML configuration was used to produce this model:
```yaml
models:
- model: inflatebot/helide-beta-r4
- model: inflatebot/helide-beta-r0
- model: inflatebot/helide-beta-r3
merge_method: model_stock
base_model: inflatebot/helide-beta-r1
dtype: bfloat16
``` |