Update README.md
Browse files
README.md
CHANGED
@@ -55,7 +55,7 @@ We have included a [chat template](https://huggingface.co/docs/transformers/main
|
|
55 |
## Intended uses & limitations
|
56 |
|
57 |
The model was initially fine-tuned on a filtered and preprocessed of the [Tulu V2 mix dataset](https://huggingface.co/datasets/allenai/tulu-v2-sft-mixture), which contains a diverse range of human created instructions and synthetic dialogues generated primarily by other LLMs.
|
58 |
-
We then further aligned the model with a [Jax
|
59 |
|
60 |
## Bias, Risks, and Limitations
|
61 |
|
|
|
55 |
## Intended uses & limitations
|
56 |
|
57 |
The model was initially fine-tuned on a filtered and preprocessed of the [Tulu V2 mix dataset](https://huggingface.co/datasets/allenai/tulu-v2-sft-mixture), which contains a diverse range of human created instructions and synthetic dialogues generated primarily by other LLMs.
|
58 |
+
We then further aligned the model with a [Jax PPO trainer](https://github.com/hamishivi/EasyLM/blob/main/EasyLM/models/llama/llama_train_ppo.py) built on [EasyLM](https://github.com/young-geng/EasyLM) on the dataset mentioned above.
|
59 |
|
60 |
## Bias, Risks, and Limitations
|
61 |
|