ruggsea
/

Llama3-stanford-encyclopedia-philosophy-QA

@@ -1,16 +1,16 @@
 ---
 license: other
 tags:
 - trl
 - sft
 - generated_from_trainer
 base_model: meta-llama/Meta-Llama-3-8B-Instruct
 model-index:
 - name: Llama3-stanford-encyclopedia-philosophy-QA
   results: []
-language:
-- en
-pipeline_tag: text-generation
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -18,23 +18,21 @@ should probably proofread and complete it, then remove this comment. -->
 # Llama3-stanford-encyclopedia-philosophy-QA
-This model is a Qlora finetune of [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) on the [Stanford Encyclopedia of Philosophy-instruct](https://huggingface.co/datasets/ruggsea/stanford-encyclopedia-of-philosophy_instruct) dataset. It is meant for answering philosophical questions in a more formal tone.
 ## Model description
-The model was trained with the following system prompt:
-```
-"You are an expert and informative yet accessible Philosophy university professor. Students will pose you philosophical questions, answer them in a correct and rigorous but not to obscure way."
-```
-Furthermore, the chat dataset was formatted using the Llama3-instruct chat format:
-```
-<|begin_of_text|><|start_header_id|>system<|end_header_id|>
-{{ system_prompt }}<|eot_id|><|start_header_id|>user<|end_header_id|>
-{{ user_message }}<|eot_id|><|start_header_id|>assistant<|end_header_id|>
-```
 ### Training hyperparameters

 ---
 license: other
+library_name: peft
 tags:
 - trl
 - sft
 - generated_from_trainer
 base_model: meta-llama/Meta-Llama-3-8B-Instruct
+datasets:
+- generator
 model-index:
 - name: Llama3-stanford-encyclopedia-philosophy-QA
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # Llama3-stanford-encyclopedia-philosophy-QA
+This model is a fine-tuned version of [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) on the generator dataset.
 ## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
 ### Training hyperparameters

adapter_config.json CHANGED Viewed

@@ -21,12 +21,12 @@
   "revision": null,
   "target_modules": [
     "q_proj",
     "k_proj",
-    "gate_proj",
-    "o_proj",
-    "down_proj",
     "v_proj",
-    "up_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

   "revision": null,
   "target_modules": [
     "q_proj",
+    "up_proj",
     "k_proj",
     "v_proj",
+    "o_proj",
+    "gate_proj",
+    "down_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:cb14ba89d00080f75b295d1fc018ae59580aa4b38afa36bf31f3f62c0f531072
 size 3443602656

 version https://git-lfs.github.com/spec/v1
+oid sha256:037bed700731486201a06fb1cc83a5e1fdef8f75558dbec764b9379dde969e52
 size 3443602656

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:015363781a553f31b602011b5b8d23b84fb65a873ae69353f276cc9a7460a7dd
 size 5048

 version https://git-lfs.github.com/spec/v1
+oid sha256:8d99081fae6a7aa87ab5fdecd808cc776b96672438f5683ea18cbaa64f7cf2aa
 size 5048