ruggsea commited on
Commit
20a0881
1 Parent(s): cc1b6ba

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +14 -24
README.md CHANGED
@@ -1,16 +1,16 @@
1
  ---
2
  license: other
3
- library_name: peft
4
  tags:
5
  - trl
6
  - sft
7
  - generated_from_trainer
8
  base_model: meta-llama/Meta-Llama-3-8B-Instruct
9
- datasets:
10
- - generator
11
  model-index:
12
  - name: Llama3-stanford-encyclopedia-philosophy-QA
13
  results: []
 
 
 
14
  ---
15
 
16
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -18,23 +18,23 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  # Llama3-stanford-encyclopedia-philosophy-QA
20
 
21
- This model is a fine-tuned version of [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) on the generator dataset.
22
- It achieves the following results on the evaluation set:
23
- - Loss: 1.9202
24
 
25
  ## Model description
26
 
27
- More information needed
 
 
 
28
 
29
- ## Intended uses & limitations
 
 
30
 
31
- More information needed
32
 
33
- ## Training and evaluation data
34
-
35
- More information needed
36
-
37
- ## Training procedure
38
 
39
  ### Training hyperparameters
40
 
@@ -52,16 +52,6 @@ The following hyperparameters were used during training:
52
 
53
  ### Training results
54
 
55
- | Training Loss | Epoch | Step | Validation Loss |
56
- |:-------------:|:------:|:----:|:---------------:|
57
- | 1.9641 | 0.3529 | 15 | 1.9525 |
58
- | 1.9096 | 0.7059 | 30 | 1.9184 |
59
- | 1.8421 | 1.0588 | 45 | 1.9071 |
60
- | 1.7913 | 1.4118 | 60 | 1.8996 |
61
- | 1.7812 | 1.7647 | 75 | 1.8928 |
62
- | 1.6468 | 2.1176 | 90 | 1.9158 |
63
- | 1.5843 | 2.4706 | 105 | 1.9286 |
64
- | 1.5829 | 2.8235 | 120 | 1.9202 |
65
 
66
 
67
  ### Framework versions
 
1
  ---
2
  license: other
 
3
  tags:
4
  - trl
5
  - sft
6
  - generated_from_trainer
7
  base_model: meta-llama/Meta-Llama-3-8B-Instruct
 
 
8
  model-index:
9
  - name: Llama3-stanford-encyclopedia-philosophy-QA
10
  results: []
11
+ language:
12
+ - en
13
+ pipeline_tag: text-generation
14
  ---
15
 
16
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
18
 
19
  # Llama3-stanford-encyclopedia-philosophy-QA
20
 
21
+ This model is a Qlora finetune of [meta-llama/Meta-Llama-3-8B](https://huggingface.co/meta-llama/Meta-Llama-3-8B) on the [Stanford Encyclopedia of Philosophy-instruct](https://huggingface.co/datasets/ruggsea/stanford-encyclopedia-of-philosophy_instruct) dataset. It is meant for answering philosophical questions in a more formal tone.
 
 
22
 
23
  ## Model description
24
 
25
+ The model was trained with the following system prompt:
26
+ ```
27
+ "You are an expert and informative yet accessible Philosophy university professor. Students will pose you philosophical questions, answer them in a correct and rigorous but not to obscure way."
28
+ ```
29
 
30
+ Furthermore, the chat dataset was formatted using the Llama3-instruct chat format:
31
+ ```
32
+ <|begin_of_text|><|start_header_id|>system<|end_header_id|>
33
 
34
+ {{ system_prompt }}<|eot_id|><|start_header_id|>user<|end_header_id|>
35
 
36
+ {{ user_message }}<|eot_id|><|start_header_id|>assistant<|end_header_id|>
37
+ ```
 
 
 
38
 
39
  ### Training hyperparameters
40
 
 
52
 
53
  ### Training results
54
 
 
 
 
 
 
 
 
 
 
 
55
 
56
 
57
  ### Framework versions