jondurbin commited on
Commit
36f16fe
1 Parent(s): 73f1414

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -42,7 +42,7 @@ datasets:
42
 
43
  An experimental fine-tune of qwen-14b using [bagel](https://github.com/jondurbin/bagel)
44
 
45
- The resulting model didn't turn out quite as great as I would have liked - in fact, I'd probably use the mistral-7b version over this, because it scored higher on mt-bench, is much faster, and generally is uncensored in comparison to this model (even after toxic DPO, several epochs)
46
 
47
  I modified the qwen tokenizer to use `<s>` instead of `<|im_start|>` and `</s>` instead of `<|endoftext|>`, and it may have caused some issues but I'm not entirely sure.
48
 
 
42
 
43
  An experimental fine-tune of qwen-14b using [bagel](https://github.com/jondurbin/bagel)
44
 
45
+ The resulting model didn't turn out quite as great as I would have liked - in fact, I'd probably use the [mistral-7b](https://huggingface.co/jondurbin/bagel-dpo-7b-v0.1) version over this, because it scored higher on mt-bench, is much faster, and generally is uncensored in comparison to this model (even after toxic DPO, several epochs)
46
 
47
  I modified the qwen tokenizer to use `<s>` instead of `<|im_start|>` and `</s>` instead of `<|endoftext|>`, and it may have caused some issues but I'm not entirely sure.
48