Commit
•
488b5d3
1
Parent(s):
a96586c
Update README.md
Browse files
README.md
CHANGED
@@ -24,7 +24,7 @@ tags:
|
|
24 |
</p>
|
25 |
|
26 |
|
27 |
-
This model is the launching partner of the capybara-dpo datasets build with distilabel. It's a preference tuned [OpenHermes-2.5-Mistral-7B](https://huggingface.co/teknium/OpenHermes-2.5-Mistral-7B).
|
28 |
|
29 |
CapybaraHermes has been preference tuned with LoRA and TRL for 3 epochs using argilla's [dpo mix 7k](https://huggingface.co/datasets/argilla/dpo-mix-7k).
|
30 |
|
|
|
24 |
</p>
|
25 |
|
26 |
|
27 |
+
This model is the launching partner of the [capybara-dpo dataset](https://huggingface.co/datasets/argilla/distilabel-capybara-dpo-9k-binarized) build with ⚗️ distilabel. It's a preference tuned [OpenHermes-2.5-Mistral-7B](https://huggingface.co/teknium/OpenHermes-2.5-Mistral-7B).
|
28 |
|
29 |
CapybaraHermes has been preference tuned with LoRA and TRL for 3 epochs using argilla's [dpo mix 7k](https://huggingface.co/datasets/argilla/dpo-mix-7k).
|
30 |
|