Update README.md
Browse files
README.md
CHANGED
@@ -5,7 +5,7 @@ license: apache-2.0
|
|
5 |
**IMPORTANT**: Make sure you have the latest version of [llama.cpp](https://github.com/ggerganov/llama.cpp) to use these:
|
6 |
|
7 |
- The [PR that allows multiple control vectors to be loaded](https://github.com/ggerganov/llama.cpp/commit/97877eb10bd8e7f8023420b5b5300bcbdadd62dc) (without zero-padding) just got merged today (27/06/24).
|
8 |
-
- Older versions of `llama.cpp` will just ***silently*** load the first control vector and none of the others
|
9 |
|
10 |
To use these control vectors effectively you will need to use the "`--control-vector-scaled`" option like this:
|
11 |
|
@@ -25,6 +25,6 @@ llama-cli --model <model name>.gguf --control-vector-scaled <model name>-dark.gg
|
|
25 |
|
26 |
- Use ***negative scale factors*** for traits you ***DON'T*** want and ***positive scale factors*** for traits you ***DO*** want.
|
27 |
- The "positive" and "law" traits likely need to use a smaller magnitude of scale factor like `-0.5` or `-0.75` (unless you like ***really*** Grim stories! :D).
|
28 |
-
- The "dark" and "chaos" traits can
|
29 |
- You can use some or all of the 4 control vector files, or alternatively set the scale factor to `0.0` for any traits you don't want to use.
|
30 |
- You can use the same "`--control-vector-scaled`" command line arguments for "`llama-server`" as in the above "`llama-cli`" examples.
|
|
|
5 |
**IMPORTANT**: Make sure you have the latest version of [llama.cpp](https://github.com/ggerganov/llama.cpp) to use these:
|
6 |
|
7 |
- The [PR that allows multiple control vectors to be loaded](https://github.com/ggerganov/llama.cpp/commit/97877eb10bd8e7f8023420b5b5300bcbdadd62dc) (without zero-padding) just got merged today (27/06/24).
|
8 |
+
- Older versions of `llama.cpp` will just ***silently*** load the first control vector (and none of the others) if the layer index of the final direction in each file does not match...
|
9 |
|
10 |
To use these control vectors effectively you will need to use the "`--control-vector-scaled`" option like this:
|
11 |
|
|
|
25 |
|
26 |
- Use ***negative scale factors*** for traits you ***DON'T*** want and ***positive scale factors*** for traits you ***DO*** want.
|
27 |
- The "positive" and "law" traits likely need to use a smaller magnitude of scale factor like `-0.5` or `-0.75` (unless you like ***really*** Grim stories! :D).
|
28 |
+
- The "dark" and "chaos" traits can often use larger magnitude scale factors like `1.5` or `2.0` before the model starts to repeat gibberish.
|
29 |
- You can use some or all of the 4 control vector files, or alternatively set the scale factor to `0.0` for any traits you don't want to use.
|
30 |
- You can use the same "`--control-vector-scaled`" command line arguments for "`llama-server`" as in the above "`llama-cli`" examples.
|