AmberYifan commited on
Commit
32608b7
1 Parent(s): ba23e32

Model save

Browse files
README.md CHANGED
@@ -2,10 +2,7 @@
2
  license: mit
3
  base_model: microsoft/Phi-3-small-8k-instruct
4
  tags:
5
- - alignment-handbook
6
  - generated_from_trainer
7
- datasets:
8
- - AmberYifan/spin-v
9
  model-index:
10
  - name: phi3-spin-zephyr-data
11
  results: []
@@ -16,15 +13,15 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  # phi3-spin-zephyr-data
18
 
19
- This model is a fine-tuned version of [microsoft/Phi-3-small-8k-instruct](https://huggingface.co/microsoft/Phi-3-small-8k-instruct) on the AmberYifan/spin-v dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 0.1748
22
- - Rewards/real: -6.0641
23
- - Rewards/generated: -21.7894
24
- - Rewards/accuracies: 0.9443
25
- - Rewards/margins: 15.7252
26
- - Logps/generated: -509.3286
27
- - Logps/real: -313.0280
28
  - Logits/generated: -inf
29
  - Logits/real: -inf
30
 
@@ -62,7 +59,9 @@ The following hyperparameters were used during training:
62
 
63
  | Training Loss | Epoch | Step | Validation Loss | Rewards/real | Rewards/generated | Rewards/accuracies | Rewards/margins | Logps/generated | Logps/real | Logits/generated | Logits/real |
64
  |:-------------:|:-----:|:----:|:---------------:|:------------:|:-----------------:|:------------------:|:---------------:|:---------------:|:----------:|:----------------:|:-----------:|
65
- | 0.2945 | 0.64 | 500 | 0.1748 | -6.0641 | -21.7894 | 0.9443 | 15.7252 | -509.3286 | -313.0280 | -inf | -inf |
 
 
66
 
67
 
68
  ### Framework versions
 
2
  license: mit
3
  base_model: microsoft/Phi-3-small-8k-instruct
4
  tags:
 
5
  - generated_from_trainer
 
 
6
  model-index:
7
  - name: phi3-spin-zephyr-data
8
  results: []
 
13
 
14
  # phi3-spin-zephyr-data
15
 
16
+ This model is a fine-tuned version of [microsoft/Phi-3-small-8k-instruct](https://huggingface.co/microsoft/Phi-3-small-8k-instruct) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 0.1643
19
+ - Rewards/real: -4.3165
20
+ - Rewards/generated: -36.8197
21
+ - Rewards/accuracies: 0.9626
22
+ - Rewards/margins: 32.5032
23
+ - Logps/generated: -659.6320
24
+ - Logps/real: -295.5523
25
  - Logits/generated: -inf
26
  - Logits/real: -inf
27
 
 
59
 
60
  | Training Loss | Epoch | Step | Validation Loss | Rewards/real | Rewards/generated | Rewards/accuracies | Rewards/margins | Logps/generated | Logps/real | Logits/generated | Logits/real |
61
  |:-------------:|:-----:|:----:|:---------------:|:------------:|:-----------------:|:------------------:|:---------------:|:---------------:|:----------:|:----------------:|:-----------:|
62
+ | 0.3303 | 0.32 | 500 | 0.2003 | -4.8459 | -23.8426 | 0.9371 | 18.9967 | -529.8613 | -300.8461 | -inf | -inf |
63
+ | 0.0933 | 0.64 | 1000 | 0.1598 | -4.6590 | -34.8525 | 0.9610 | 30.1935 | -639.9600 | -298.9768 | -inf | -inf |
64
+ | 0.2065 | 0.96 | 1500 | 0.1643 | -4.3165 | -36.8197 | 0.9626 | 32.5032 | -659.6320 | -295.5523 | -inf | -inf |
65
 
66
 
67
  ### Framework versions
all_results.json CHANGED
@@ -1,8 +1,8 @@
1
  {
2
  "epoch": 1.0,
3
- "train_loss": 0.6284351689954493,
4
- "train_runtime": 6560.8804,
5
- "train_samples": 25000,
6
- "train_samples_per_second": 3.81,
7
- "train_steps_per_second": 0.119
8
  }
 
1
  {
2
  "epoch": 1.0,
3
+ "train_loss": 0.42768371397759275,
4
+ "train_runtime": 16836.7545,
5
+ "train_samples": 50000,
6
+ "train_samples_per_second": 2.97,
7
+ "train_steps_per_second": 0.093
8
  }
model-00001-of-00004.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1efa10e64d5859b0237937a454e8d0e674af6517f71086f2e7e98bc3147fe71c
3
  size 4832943104
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4d2f34344b3f845374bd0eacb6371b7d83544a4d2c0d700d41c79a9b8acbd813
3
  size 4832943104
model-00002-of-00004.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1ac4af61d8a50b24b2a0451b0d0d8d681f445298adb99946728417f8ea8a5f8c
3
  size 4799608224
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3b406d4b4abb0ab8def821820a778510ec2101601576296a3500370984f1c0d5
3
  size 4799608224
model-00003-of-00004.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4cbc2426e9fb18df7c282f39a9fd4f5523abf48457b84c80ecdc7b84bf172b63
3
  size 4799608240
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5a3142a41839ca7e6a4a8f5041514b14710370172619404fd630af6b94ec3d13
3
  size 4799608240
model-00004-of-00004.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:856cab213b70e9a43d567a410c8964012fdc3434091381c99187734430660556
3
  size 352437304
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:292b1c6b6cc1e92b4e0a1e1aadc9d4c222af8ecdba677dda2afadc990b1fd700
3
  size 352437304
runs/Jul25_17-12-28_gilbreth-j001.rcac.purdue.edu/events.out.tfevents.1721942132.gilbreth-j001.rcac.purdue.edu.261418.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f7673a7b5503af583322470d0c20eb639908de807333f372de69385802bdafef
3
- size 104161
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:949793e977ea7ce2d23b7191162eb29e29eeca5e82e56df05a20a5d165328401
3
+ size 108301
train_results.json CHANGED
@@ -1,8 +1,8 @@
1
  {
2
  "epoch": 1.0,
3
- "train_loss": 0.6284351689954493,
4
- "train_runtime": 6560.8804,
5
- "train_samples": 25000,
6
- "train_samples_per_second": 3.81,
7
- "train_steps_per_second": 0.119
8
  }
 
1
  {
2
  "epoch": 1.0,
3
+ "train_loss": 0.42768371397759275,
4
+ "train_runtime": 16836.7545,
5
+ "train_samples": 50000,
6
+ "train_samples_per_second": 2.97,
7
+ "train_steps_per_second": 0.093
8
  }
trainer_state.json CHANGED
The diff for this file is too large to render. See raw diff