Edit model card

License: https://llama.meta.com/llama3/license

240608: Now deprecated in favor of https://huggingface.co/neph1/llama-3-instruct-bellman-8b-swedish

240508: Where is the real version?

There's been a number of issues with llama 3 finetuning, and most recently an issue with the tokenizer for ggufs. I'm holding back any release until the situation has stabilizied.

240421: Bellman's back! This is a preliminary test run, run on mostly the same settings and dataset as bellman-mistral-instruct (But not the dpo pass). Context length is 3072. I've done some basic testing, and it's not a total mess. Whether it's an improvement to llama-3-instruct, I'm not sure, because that's REALLY good.

I'll try to make a pass on full context length soon. And hopefully improve the results more.

Make sure to use the correct chat template (llama-3) for best results. I tested with koboldcpp.

Let me know if you'd like any specific version. To save time I only put the one up for this 'preview'.

Trained with Unsloth (https://github.com/unslothai/unsloth).

image/png

User: Vem var Edward Teach?

AI: Edward Teach, även känd som Svartskägg, var en engelsk sjörövare och piratkapten under 1700-talet. Han anses vara en av de mest berömda sjörövarna i historien.

User: Träffade han Gustav Vasa?

AI: Nej, Edward Teach, även känd som Svartskägg, levde mellan ca 1680-1720, medan Gustav Vasa dog 1560. Deras tidslinjer korsades inte.

User: Berätta om selma lagerlöf

AI: Selma Lagerlöf var en svensk författare som vann Nobelpriset i litteratur 1909. Hon är mest känd för sina verk 'Nils Holgerssons underbara resa genom Sverige' och 'Jerusalem'. Hennes skrivande inspirerades ofta av den svenska naturen och folket.

Downloads last month
41
GGUF
Model size
8.03B params
Architecture
llama

6-bit

Inference API
Unable to determine this model's library. Check the docs .