Update README.md
Browse files
README.md
CHANGED
@@ -12,7 +12,7 @@ license: llama3
|
|
12 |
|
13 |
# Model Card for Llama-3-8B-Instruct-Iterative-SamPO
|
14 |
|
15 |
-
This repository provides a fine-tuned version of Llama-3-8B-Instruct, using our proposed [SamPO](https://github.com/LuJunru/SamPO) algorithm. We obey all licenses mentioned in llama3's work.
|
16 |
|
17 |
## Performance
|
18 |
|
|
|
12 |
|
13 |
# Model Card for Llama-3-8B-Instruct-Iterative-SamPO
|
14 |
|
15 |
+
This repository provides a fine-tuned version of Llama-3-8B-Instruct, using our proposed [SamPO](https://github.com/LuJunru/SamPO) algorithm: Eliminating Biased Length Reliance of Direct Preference Optimization via Down-Sampled KL Divergence. We obey all licenses mentioned in llama3's work.
|
16 |
|
17 |
## Performance
|
18 |
|