Update README.md
Browse files
README.md
CHANGED
@@ -42,7 +42,7 @@ datasets:
|
|
42 |
|
43 |
An experimental fine-tune of qwen-14b using [bagel](https://github.com/jondurbin/bagel)
|
44 |
|
45 |
-
The resulting model didn't turn out quite as great as I would have liked - in fact, I'd probably use the mistral-7b version over this, because it scored higher on mt-bench, is much faster, and generally is uncensored in comparison to this model (even after toxic DPO, several epochs)
|
46 |
|
47 |
I modified the qwen tokenizer to use `<s>` instead of `<|im_start|>` and `</s>` instead of `<|endoftext|>`, and it may have caused some issues but I'm not entirely sure.
|
48 |
|
|
|
42 |
|
43 |
An experimental fine-tune of qwen-14b using [bagel](https://github.com/jondurbin/bagel)
|
44 |
|
45 |
+
The resulting model didn't turn out quite as great as I would have liked - in fact, I'd probably use the [mistral-7b](https://huggingface.co/jondurbin/bagel-dpo-7b-v0.1) version over this, because it scored higher on mt-bench, is much faster, and generally is uncensored in comparison to this model (even after toxic DPO, several epochs)
|
46 |
|
47 |
I modified the qwen tokenizer to use `<s>` instead of `<|im_start|>` and `</s>` instead of `<|endoftext|>`, and it may have caused some issues but I'm not entirely sure.
|
48 |
|