allenai
/

tulu-2-dpo-7b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

hamishivi commited on Nov 20, 2023

Commit

f1a3088

•

1 Parent(s): f05fb21

Update README.md

Files changed (1) hide show

README.md +9 -6

README.md CHANGED Viewed

@@ -20,6 +20,8 @@ Tulu is a series of language models that are trained to act as helpful assistant
 Tulu V2 DPO 7B is a fine-tuned version of Llama 2 that was trained on on a mix of publicly available, synthetic and human datasets using [Direct Preference Optimization (DPO)](https://arxiv.org/abs/2305.18290).
 This model is a strong alternative to Llama 2 7b Chat.
 ## Model description
@@ -134,12 +136,13 @@ The following hyperparameters were used during DPO training:
 If you find Tulu 2 is useful in your work, please cite it with:
 ```
-@misc{ivison2023changing,
-   title={Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2},
-   author={Hamish Ivison and Yizhong Wang and Valentina Pyatkin and Nathan Lambert and Matthew Peters and Pradeep Dasigi and Joel Jang and David Wadden and Noah A. Smith and Iz Beltagy and Hannaneh Hajishirzi},
-   year={2023},
-   archivePrefix={arXiv},
-   primaryClass={cs.CL}
 }
 ```

 Tulu V2 DPO 7B is a fine-tuned version of Llama 2 that was trained on on a mix of publicly available, synthetic and human datasets using [Direct Preference Optimization (DPO)](https://arxiv.org/abs/2305.18290).
 This model is a strong alternative to Llama 2 7b Chat.
+For more details, read the paper: [Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2
+](https://arxiv.org/abs/2311.10702).
 ## Model description
 If you find Tulu 2 is useful in your work, please cite it with:
 ```
+@misc{ivison2023camels,
+      title={Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2},
+      author={Hamish Ivison and Yizhong Wang and Valentina Pyatkin and Nathan Lambert and Matthew Peters and Pradeep Dasigi and Joel Jang and David Wadden and Noah A. Smith and Iz Beltagy and Hannaneh Hajishirzi},
+      year={2023},
+      eprint={2311.10702},
+      archivePrefix={arXiv},
+      primaryClass={cs.CL}
 }
 ```