aashish1904 commited on
Commit
78241ec
1 Parent(s): 8c7b278

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +42 -0
README.md ADDED
@@ -0,0 +1,42 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
+ ---
3
+
4
+ library_name: transformers
5
+ license: llama3.1
6
+ datasets:
7
+ - jondurbin/gutenberg-dpo-v0.1
8
+ - nbeerbower/gutenberg2-dpo
9
+ - jondurbin/truthy-dpo-v0.1
10
+ - kyujinpy/orca_math_dpo
11
+ - antiven0m/physical-reasoning-dpo
12
+ base_model:
13
+ - mlabonne/Meta-Llama-3.1-8B-Instruct-abliterated
14
+
15
+ ---
16
+
17
+ [![QuantFactory Banner](https://lh7-rt.googleusercontent.com/docsz/AD_4nXeiuCm7c8lEwEJuRey9kiVZsRn2W-b4pWlu3-X534V3YmVuVc2ZL-NXg2RkzSOOS2JXGHutDuyyNAUtdJI65jGTo8jT9Y99tMi4H4MqL44Uc5QKG77B0d6-JfIkZHFaUA71-RtjyYZWVIhqsNZcx8-OMaA?key=xt3VSDoCbmTY7o-cwwOFwQ)](https://hf.co/QuantFactory)
18
+
19
+
20
+ # QuantFactory/Llama3.1-Allades-8B-GGUF
21
+ This is quantized version of [nbeerbower/Llama3.1-Allades-8B](https://huggingface.co/nbeerbower/Llama3.1-Allades-8B) created using llama.cpp
22
+
23
+ # Original Model Card
24
+
25
+
26
+ # Llama3.1-Allades-8B
27
+
28
+ Allades finetunes abliterated Llama 3.1 with 5 datasets to improve creative writing, reasoning, and roleplay.
29
+
30
+ ## Datasets
31
+
32
+ - [jondurbin/gutenberg-dpo-v0.1](https://huggingface.co/datasets/jondurbin/gutenberg-dpo-v0.1)
33
+ - [nbeerbower/gutenberg2-dpo](https://huggingface.co/datasets/nbeerbower/gutenberg2-dpo)
34
+ - [jondurbin/truthy-dpo-v0.1](https://huggingface.co/datasets/jondurbin/truthy-dpo-v0.1)
35
+ - [kyujinpy/orca_math_dpo](https://huggingface.co/datasets/kyujinpy/orca_math_dpo)
36
+ - [antiven0m/physical-reasoning-dpo](https://huggingface.co/datasets/antiven0m/physical-reasoning-dpo)
37
+
38
+ ## Training
39
+
40
+ [ORPO tuned](https://mlabonne.github.io/blog/posts/2024-04-19_Fine_tune_Llama_3_with_ORPO.html) for 1 epoch with 2x RTX 3090 (sponsored by [Schneewolf Labs](https://schneewolflabs.com)).
41
+
42
+ Data was prepared with [Llama 3.1 Instruct](https://www.llama.com/docs/model-cards-and-prompt-formats/llama3_1/).