Junrulu commited on
Commit
35869f7
1 Parent(s): 5d3299a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -12,7 +12,7 @@ license: llama3
12
 
13
  # Model Card for Llama-3-8B-Instruct-Iterative-SamPO
14
 
15
- This repository provides a fine-tuned version of Llama-3-8B-Instruct, using our proposed [SamPO](https://github.com/LuJunru/SamPO) algorithm. We obey all licenses mentioned in llama3's work.
16
 
17
  ## Performance
18
 
 
12
 
13
  # Model Card for Llama-3-8B-Instruct-Iterative-SamPO
14
 
15
+ This repository provides a fine-tuned version of Llama-3-8B-Instruct, using our proposed [SamPO](https://github.com/LuJunru/SamPO) algorithm: Eliminating Biased Length Reliance of Direct Preference Optimization via Down-Sampled KL Divergence. We obey all licenses mentioned in llama3's work.
16
 
17
  ## Performance
18