L3-8B-Helium3 / README.md
inflatebot's picture
Update README.md
9d2fa35 verified
|
raw
history blame
2.91 kB
metadata
base_model:
  - inflatebot/helide-beta-r3
  - inflatebot/helide-beta-r1
  - inflatebot/helide-beta-r4
  - inflatebot/helide-beta-r0
library_name: transformers
tags:
  - mergekit
  - merge
  - not-for-all-audiences

L3-8B-Helium3

This is a merge of pre-trained language models created using mergekit.

Made with NovelAI Helium-3 (He-3) is a stable isotope of helium with two protons and one neutron. Outside of ordinary hydrogen, He-3 is the only stable isotope of any element with more protons than neutrons. It's potentially useful as fuel for nuclear fusion power.

Merge Details

There was a problem with the Helide beta. 3 models resulted, each of which had different strengths. But they came about as a result of balancing two models. There wasn't going to be a way to get the best of all three worlds just by tweaking a SLERP ratio. The math wasn't quite mathing.

But there were three of them.

The name was serendipity.

The layup was obscene.

But I live for the bit.

Helium-3 is a RP and storywriting hybrid, ultimately based on Sao10K's Stheno and Fizzarolli's Rosier, and the culmination of the Helide project. Combining Rosier's prose and knowledge of niche fetish with Stheno's steerability and crackling personality, Helium-3 brings the advancements of modern AI models to the Freaks™. They'll chew you up and spit you out just as readily as they'll shower you with affection.

I'm genuinely proud of this one. This is the model I wish existed.

Thank you to Fizzarolli for consulting and providing technical assistance which accelerated the second leg of this project from several weeks into a single night, and for making the Rosier model that made this possible. On several levels, He-3 wouldn't have been possible without her.

Quants available from mradermacher

Merge Method

This model was merged using the Model Stock merge method using inflatebot/helide-beta-r1 as a base.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

models:
  - model: inflatebot/helide-beta-r4
  - model: inflatebot/helide-beta-r0
  - model: inflatebot/helide-beta-r3

merge_method: model_stock
base_model: inflatebot/helide-beta-r1
dtype: bfloat16