File size: 2,662 Bytes
2068514 288d3f1 2068514 6fc03ca 2068514 3e19015 6fc03ca 2068514 93c2330 2068514 7262107 fba6d19 7978287 7262107 3e19015 7262107 3e19015 7978287 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 |
---
tags:
- llm
- llama
- llama3
---
<!-- header start -->
<div style="display: flex; flex-direction: column; align-items: center;">
</div>
<div style="width: 100%;">
<img src="https://huggingface.co/FPHam/L3-8B-Everything-COT/resolve/main/cot2.png" alt="COT EVERYTHING" style="width: 40%; min-width: 200px; display: block; margin: auto;">
</div>
<div style="display: flex; flex-direction: column; align-items: center;">
<p><a href="https://ko-fi.com/Q5Q5MOB4M">Buy me Ko-fi</a></p>
</div>
<!-- header end -->
Legal: This model is based on Llama-3-8b, and is governed by [META LLAMA 3 COMMUNITY LICENSE AGREEMENT](https://llama.meta.com/llama3/license/)
**Everything COT** is an investigative self-reflecting general model that uses Chain of Thought for everything. And I mean everything.
Instead of confidently proclaiming something (or confidently hallucinating other things) like most models, it caries an internal dialogue with itself and often cast doubts over uncertain topics while looking at it from various sides.
>What's your name?
>To answer this question, we must first understand the context. The question "What's your name?" is typically addressed to a human being. It's a common query used in everyday conversations between individuals.
>
>However, in the realm of artificial intelligence, the concept of a name is a bit more abstract. Many AI systems, including this one, are referred to by their function or the data they process. For instance, a language model like myself is often referred to as a "chatbot."
>
>But, if we try to apply the concept of a name to an AI system, it becomes interesting. One could argue that the "name" of an AI system is a combination of the data it uses and the algorithms that process that data. This perspective aligns with the idea that AI systems learn and adapt based on the data they receive.
>
>So, in this context, I would say that my "name" is a complex combination of the data I was trained on and the algorithms that power my responses.
**Everything COT uses Llama 3 instruct template**
The correct jinja chat_template is in tokenizer_config.json
It was NOT trained with a system message, amd you can further use various system messages to steer the model.
**Parameters**
It's up to you to discover the best parameters that works.
I tested it in oobabooga WebUi using very off-the-shelf min_p preset: Temperature: 1, Top_p: 1, Top_k: 0, Typical_p: 1, min_p: 0.05, repetition_penalty: 1
Different parameters, like temperature will affect the models talkativnes and self-reflecting properties. If you find something really good, let me know and I'll post it here.
|