JacopoAbate
commited on
Commit
•
7616f45
1
Parent(s):
011719d
Update README.md
Browse files
README.md
CHANGED
@@ -17,7 +17,7 @@ metrics:
|
|
17 |
# Model Information
|
18 |
|
19 |
|
20 |
-
|
21 |
|
22 |
- It's trained on [ultrafeedback-binarized-preferences-cleaned](https://huggingface.co/datasets/argilla/ultrafeedback-binarized-preferences-cleaned).
|
23 |
|
@@ -43,8 +43,8 @@ from transformers import AutoModelForCausalLM, AutoTokenizer
|
|
43 |
|
44 |
device = "cpu" # if you want to use the gpu make sure to have cuda toolkit installed and change this to "cuda"
|
45 |
|
46 |
-
model = AutoModelForCausalLM.from_pretrained("MoxoffSpA/
|
47 |
-
tokenizer = AutoTokenizer.from_pretrained("MoxoffSpA/
|
48 |
|
49 |
question = """Quanto è alta la torre di Pisa?"""
|
50 |
context = """
|
@@ -78,7 +78,7 @@ print(trimmed_output)
|
|
78 |
|
79 |
## Bias, Risks and Limitations
|
80 |
|
81 |
-
|
82 |
responses like ChatGPT, so the model can produce problematic outputs (especially when prompted to do so). It is also unknown what the size and composition
|
83 |
of the corpus was used to train the base model, however it is likely to have included a mix of Web data and technical sources
|
84 |
like books and code.
|
|
|
17 |
# Model Information
|
18 |
|
19 |
|
20 |
+
Moxoff-Phi3Mini-PPO is an updated version of [Phi-3-mini-128k-instruct](https://huggingface.co/microsoft/Phi-3-mini-128k-instruct), aligned with PPO.
|
21 |
|
22 |
- It's trained on [ultrafeedback-binarized-preferences-cleaned](https://huggingface.co/datasets/argilla/ultrafeedback-binarized-preferences-cleaned).
|
23 |
|
|
|
43 |
|
44 |
device = "cpu" # if you want to use the gpu make sure to have cuda toolkit installed and change this to "cuda"
|
45 |
|
46 |
+
model = AutoModelForCausalLM.from_pretrained("MoxoffSpA/Moxoff-Phi3Mini-PPO")
|
47 |
+
tokenizer = AutoTokenizer.from_pretrained("MoxoffSpA/Moxoff-Phi3Mini-PPO")
|
48 |
|
49 |
question = """Quanto è alta la torre di Pisa?"""
|
50 |
context = """
|
|
|
78 |
|
79 |
## Bias, Risks and Limitations
|
80 |
|
81 |
+
Moxoff-Phi3Mini-PPO has not been aligned to human preferences for safety within the RLHF phase or deployed with in-the-loop filtering of
|
82 |
responses like ChatGPT, so the model can produce problematic outputs (especially when prompted to do so). It is also unknown what the size and composition
|
83 |
of the corpus was used to train the base model, however it is likely to have included a mix of Web data and technical sources
|
84 |
like books and code.
|