Update README.md
Browse files
README.md
CHANGED
@@ -7,7 +7,7 @@ language:
|
|
7 |
---
|
8 |
|
9 |
# Suri-I-ORPO
|
10 |
-
Suri-I-ORPO is a fine-tuned version of mistralai/Mistral-7B-Instruct-v0.2 using instructional odds ratio preference optimization (I-ORPO). Please check [our paper](
|
11 |
|
12 |
## π Model Details
|
13 |
|
@@ -99,7 +99,15 @@ print(tokenizer.decode(output[0]))
|
|
99 |
## π Citation
|
100 |
|
101 |
```
|
102 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
103 |
```
|
104 |
|
105 |
### βοΈ Framework versions
|
|
|
7 |
---
|
8 |
|
9 |
# Suri-I-ORPO
|
10 |
+
Suri-I-ORPO is a fine-tuned version of mistralai/Mistral-7B-Instruct-v0.2 using instructional odds ratio preference optimization (I-ORPO). Please check [our paper](https://arxiv.org/abs/2406.19371) for more details on the method.
|
11 |
|
12 |
## π Model Details
|
13 |
|
|
|
99 |
## π Citation
|
100 |
|
101 |
```
|
102 |
+
@misc{pham2024surimulticonstraintinstructionfollowing,
|
103 |
+
title={Suri: Multi-constraint Instruction Following for Long-form Text Generation},
|
104 |
+
author={Chau Minh Pham and Simeng Sun and Mohit Iyyer},
|
105 |
+
year={2024},
|
106 |
+
eprint={2406.19371},
|
107 |
+
archivePrefix={arXiv},
|
108 |
+
primaryClass={cs.CL},
|
109 |
+
url={https://arxiv.org/abs/2406.19371},
|
110 |
+
}
|
111 |
```
|
112 |
|
113 |
### βοΈ Framework versions
|