dhmeltzer commited on
Commit
7b8589f
1 Parent(s): d23bfd5

Model save

Browse files
Files changed (1) hide show
  1. README.md +25 -0
README.md CHANGED
@@ -5,6 +5,7 @@ tags:
5
  model-index:
6
  - name: Llama-2-7b-hf-wiki-no-group-by-length_r_64_alpha_16
7
  results: []
 
8
  ---
9
 
10
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -30,6 +31,28 @@ More information needed
30
 
31
  ## Training procedure
32
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
33
  ### Training hyperparameters
34
 
35
  The following hyperparameters were used during training:
@@ -61,6 +84,8 @@ The following hyperparameters were used during training:
61
 
62
  ### Framework versions
63
 
 
 
64
  - Transformers 4.31.0
65
  - Pytorch 2.0.0
66
  - Datasets 2.12.0
 
5
  model-index:
6
  - name: Llama-2-7b-hf-wiki-no-group-by-length_r_64_alpha_16
7
  results: []
8
+ library_name: peft
9
  ---
10
 
11
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
31
 
32
  ## Training procedure
33
 
34
+
35
+ The following `bitsandbytes` quantization config was used during training:
36
+ - load_in_8bit: False
37
+ - load_in_4bit: True
38
+ - llm_int8_threshold: 6.0
39
+ - llm_int8_skip_modules: None
40
+ - llm_int8_enable_fp32_cpu_offload: False
41
+ - llm_int8_has_fp16_weight: False
42
+ - bnb_4bit_quant_type: nf4
43
+ - bnb_4bit_use_double_quant: True
44
+ - bnb_4bit_compute_dtype: bfloat16
45
+
46
+ The following `bitsandbytes` quantization config was used during training:
47
+ - load_in_8bit: False
48
+ - load_in_4bit: True
49
+ - llm_int8_threshold: 6.0
50
+ - llm_int8_skip_modules: None
51
+ - llm_int8_enable_fp32_cpu_offload: False
52
+ - llm_int8_has_fp16_weight: False
53
+ - bnb_4bit_quant_type: nf4
54
+ - bnb_4bit_use_double_quant: True
55
+ - bnb_4bit_compute_dtype: bfloat16
56
  ### Training hyperparameters
57
 
58
  The following hyperparameters were used during training:
 
84
 
85
  ### Framework versions
86
 
87
+ - PEFT 0.4.0
88
+ - PEFT 0.4.0
89
  - Transformers 4.31.0
90
  - Pytorch 2.0.0
91
  - Datasets 2.12.0