cowWhySo
/

Phi-3-mini-4k-instruct-Friendly

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

cowWhySo commited on Jun 7

Commit

e9ed0be

•

1 Parent(s): 64fe59a

Update README.md

Files changed (1) hide show

README.md +39 -3

README.md CHANGED Viewed

@@ -3,10 +3,12 @@ license: mit
 datasets:
 - mlabonne/orpo-dpo-mix-40k
 ---
-Abliterated using the following the guide here:
-https://huggingface.co/blog/mlabonne/abliteration
 [<img src="https://raw.githubusercontent.com/OpenAccess-AI-Collective/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/OpenAccess-AI-Collective/axolotl)
 <details><summary>See axolotl config</summary>
@@ -85,6 +87,40 @@ max_grad_norm: 1.0
 resize_token_embeddings_to_32x: true
 ```
 ## Quants
-GGUF: https://huggingface.co/cowWhySo/Phi-3-mini-4k-instruct-Friendly-gguf

 datasets:
 - mlabonne/orpo-dpo-mix-40k
 ---
+This is a uncenscored version of Phi-3.
+Abliterated using the following the guide here: https://huggingface.co/blog/mlabonne/abliteration
+Then it was fine tuned on orpo-dpo-mix-40k
 [<img src="https://raw.githubusercontent.com/OpenAccess-AI-Collective/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/OpenAccess-AI-Collective/axolotl)
 <details><summary>See axolotl config</summary>
 resize_token_embeddings_to_32x: true
 ```
+</details><br>
 ## Quants
+GGUF: https://huggingface.co/cowWhySo/Phi-3-mini-4k-instruct-Friendly-gguf
+## Training Summary
+```json
+{
+  "train/loss": 0.299,
+  "train/grad_norm": 0.9337566701340533,
+  "train/learning_rate": 0,
+  "train/rewards/chosen": 0.08704188466072083,
+  "train/rewards/rejected": -2.835820436477661,
+  "train/rewards/accuracies": 0.84375,
+  "train/rewards/margins": 2.9228620529174805,
+  "train/logps/rejected": -509.9840393066406,
+  "train/logps/chosen": -560.8234252929688,
+  "train/logits/rejected": 1.6356163024902344,
+  "train/logits/chosen": 1.7323706150054932,
+  "train/epoch": 1.002169197396963,
+  "train/global_step": 231,
+  "_timestamp": 1717711643.3345022,
+  "_runtime": 22808.557655334473,
+  "_step": 231,
+  "train_runtime": 22809.152,
+  "train_samples_per_second": 1.944,
+  "train_steps_per_second": 0.01,
+  "total_flos": 0,
+  "train_loss": 0.44557410065745895,
+  "_wandb": {
+    "runtime": 22810
+  }
+}
+```