charlesdedampierre commited on
Commit
ee0dcc3
1 Parent(s): c1c3fdd

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -2
README.md CHANGED
@@ -12,10 +12,13 @@ The model was trained on a refined DPO dataset. The objective was to train the m
12
  We then conducted topic modeling on both datasets, keeping only the topics that existed in the accepted dataset but not in the rejected one.
13
  Our hypothesis is that these topics encapsulate the main differences between the two answering styles.
14
 
15
- This method allows for quicker convergence with significantly less data (around 1/6 of the initial dataset).
16
 
17
  Special thanks to [mlabonne](https://huggingface.co/mlabonne) for creating the [colab notebook](https://colab.research.google.com/drive/15iFBr1xWgztXvhrj5I9fBv20c7CFOPBE?usp=sharing#scrollTo=YpdkZsMNylvp) that facilitated the DPO Strategy.
18
 
 
 
 
19
 
20
  ## Topic Analysis
21
 
@@ -121,4 +124,3 @@ print(sequences[0]['generated_text'])
121
 
122
 
123
  You can find the results of the running on Weights & Biases: https://wandb.ai/bunka/huggingface/runs/xq59p47g?workspace=user-charlesdedampierre
124
-
 
12
  We then conducted topic modeling on both datasets, keeping only the topics that existed in the accepted dataset but not in the rejected one.
13
  Our hypothesis is that these topics encapsulate the main differences between the two answering styles.
14
 
15
+ This method allows for quicker convergence with significantly less data (around 1/6 of the initial dataset). The Dataset can be found at [bunkalab/topic_based_chatml_dpo_pairs](https://huggingface.co/datasets/bunkalab/topic_based_chatml_dpo_pairs)
16
 
17
  Special thanks to [mlabonne](https://huggingface.co/mlabonne) for creating the [colab notebook](https://colab.research.google.com/drive/15iFBr1xWgztXvhrj5I9fBv20c7CFOPBE?usp=sharing#scrollTo=YpdkZsMNylvp) that facilitated the DPO Strategy.
18
 
19
+ Results of the model can be found here: We do as well as similar models with way less data and computing power :)
20
+
21
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/63c13d74f02ef5b95e0e448e/J8b23NeMhLeqXLQMZwxdh.png)
22
 
23
  ## Topic Analysis
24
 
 
124
 
125
 
126
  You can find the results of the running on Weights & Biases: https://wandb.ai/bunka/huggingface/runs/xq59p47g?workspace=user-charlesdedampierre