Junrulu
/

Llama-3-8B-Instruct-Iterative-SamPO

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Junrulu commited on Jun 11

Commit

35869f7

•

1 Parent(s): 5d3299a

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -12,7 +12,7 @@ license: llama3
 # Model Card for Llama-3-8B-Instruct-Iterative-SamPO
-This repository provides a fine-tuned version of Llama-3-8B-Instruct, using our proposed [SamPO](https://github.com/LuJunru/SamPO) algorithm. We obey all licenses mentioned in llama3's work.
 ## Performance

 # Model Card for Llama-3-8B-Instruct-Iterative-SamPO
+This repository provides a fine-tuned version of Llama-3-8B-Instruct, using our proposed [SamPO](https://github.com/LuJunru/SamPO) algorithm: Eliminating Biased Length Reliance of Direct Preference Optimization via Down-Sampled KL Divergence. We obey all licenses mentioned in llama3's work.
 ## Performance