Model save
Browse files- README.md +18 -5
- events.out.tfevents.1717488111.isl-gpu33.2434801.0 +2 -2
- log.txt +133 -0
- model.safetensors +1 -1
README.md
CHANGED
@@ -15,10 +15,10 @@ should probably proofread and complete it, then remove this comment. -->
|
|
15 |
|
16 |
# recreate_llama_68M_vanilla
|
17 |
|
18 |
-
This model is a fine-tuned version of [JackFram/llama-68m](https://huggingface.co/JackFram/llama-68m) on
|
19 |
It achieves the following results on the evaluation set:
|
20 |
-
- Loss:
|
21 |
-
- Accuracy: 0.
|
22 |
|
23 |
## Model description
|
24 |
|
@@ -38,8 +38,8 @@ More information needed
|
|
38 |
|
39 |
The following hyperparameters were used during training:
|
40 |
- learning_rate: 0.0001
|
41 |
-
- train_batch_size:
|
42 |
-
- eval_batch_size:
|
43 |
- seed: 42
|
44 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
45 |
- lr_scheduler_type: linear
|
@@ -48,6 +48,19 @@ The following hyperparameters were used during training:
|
|
48 |
|
49 |
### Training results
|
50 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
51 |
|
52 |
|
53 |
### Framework versions
|
|
|
15 |
|
16 |
# recreate_llama_68M_vanilla
|
17 |
|
18 |
+
This model is a fine-tuned version of [JackFram/llama-68m](https://huggingface.co/JackFram/llama-68m) on an unknown dataset.
|
19 |
It achieves the following results on the evaluation set:
|
20 |
+
- Loss: 2.3603
|
21 |
+
- Accuracy: 0.5811
|
22 |
|
23 |
## Model description
|
24 |
|
|
|
38 |
|
39 |
The following hyperparameters were used during training:
|
40 |
- learning_rate: 0.0001
|
41 |
+
- train_batch_size: 24
|
42 |
+
- eval_batch_size: 48
|
43 |
- seed: 42
|
44 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
45 |
- lr_scheduler_type: linear
|
|
|
48 |
|
49 |
### Training results
|
50 |
|
51 |
+
| Training Loss | Epoch | Step | Validation Loss | Accuracy |
|
52 |
+
|:-------------:|:------:|:-----:|:---------------:|:--------:|
|
53 |
+
| 3.406 | 0.2644 | 1000 | 3.2345 | 0.5035 |
|
54 |
+
| 2.8119 | 0.5288 | 2000 | 2.8216 | 0.5365 |
|
55 |
+
| 2.6076 | 0.7932 | 3000 | 2.6553 | 0.5501 |
|
56 |
+
| 2.4729 | 1.0576 | 4000 | 2.5761 | 0.5581 |
|
57 |
+
| 2.4323 | 1.3221 | 5000 | 2.5363 | 0.5617 |
|
58 |
+
| 2.3824 | 1.5865 | 6000 | 2.4913 | 0.5660 |
|
59 |
+
| 2.3719 | 1.8509 | 7000 | 2.4664 | 0.5686 |
|
60 |
+
| 2.3021 | 2.1153 | 8000 | 2.4404 | 0.5716 |
|
61 |
+
| 2.2848 | 2.3797 | 9000 | 2.4080 | 0.5755 |
|
62 |
+
| 2.2653 | 2.6441 | 10000 | 2.3834 | 0.5785 |
|
63 |
+
| 2.2447 | 2.9085 | 11000 | 2.3603 | 0.5811 |
|
64 |
|
65 |
|
66 |
### Framework versions
|
events.out.tfevents.1717488111.isl-gpu33.2434801.0
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:be85f504bca71a575e7d21a6c09da598a15be6f6a806bb426595d8f930fff8ae
|
3 |
+
size 13500
|
log.txt
CHANGED
@@ -885,3 +885,136 @@ Loading cached processed dataset at /home/dshteyma/.cache/huggingface/datasets/j
|
|
885 |
[INFO|tokenization_utils_base.py:2512] 2024-06-04 06:34:46,021 >> Special tokens file saved in ./training_outputs_job_116987_1_04-06_01-01/special_tokens_map.json
|
886 |
/home/dshteyma/miniconda3/lib/python3.9/site-packages/torch/nn/parallel/_functions.py:68: UserWarning: Was asked to gather along dimension 0, but all input tensors were scalars; will instead unsqueeze and return a vector.
|
887 |
warnings.warn('Was asked to gather along dimension 0, but all '
|
|
|
888 |
97%|ββββββββββ| 11001/11346 [5:32:56<3:58:05, 41.41s/it]
|
889 |
97%|ββββββββββ| 11002/11346 [5:32:58<2:49:05, 29.49s/it]
|
890 |
97%|ββββββββββ| 11003/11346 [5:33:00<2:00:54, 21.15s/it]
|
891 |
97%|ββββββββββ| 11004/11346 [5:33:01<1:27:16, 15.31s/it]
|
892 |
97%|ββββββββββ| 11005/11346 [5:33:03<1:03:47, 11.22s/it]
|
893 |
97%|ββββββββββ| 11006/11346 [5:33:05<47:22, 8.36s/it]
|
894 |
97%|ββββββββββ| 11007/11346 [5:33:06<35:56, 6.36s/it]
|
895 |
97%|ββββββββββ| 11008/11346 [5:33:08<27:55, 4.96s/it]
|
896 |
97%|ββββββββββ| 11009/11346 [5:33:10<22:20, 3.98s/it]
|
897 |
97%|ββββββββββ| 11010/11346 [5:33:11<18:24, 3.29s/it]
|
898 |
97%|ββββββββββ| 11011/11346 [5:33:13<15:40, 2.81s/it]
|
899 |
97%|ββββββββββ| 11012/11346 [5:33:15<13:44, 2.47s/it]
|
900 |
97%|ββββββββββ| 11013/11346 [5:33:16<12:23, 2.23s/it]
|
901 |
97%|ββββββββββ| 11014/11346 [5:33:18<11:26, 2.07s/it]
|
902 |
97%|ββββββββββ| 11015/11346 [5:33:20<10:46, 1.95s/it]
|
903 |
97%|ββββββββββ| 11016/11346 [5:33:22<10:17, 1.87s/it]
|
904 |
97%|ββββββββββ| 11017/11346 [5:33:23<09:56, 1.81s/it]
|
905 |
97%|ββββββββββ| 11018/11346 [5:33:25<09:42, 1.77s/it]
|
906 |
97%|ββββββββββ| 11019/11346 [5:33:27<09:31, 1.75s/it]
|
907 |
97%|ββββββββββ| 11020/11346 [5:33:28<09:23, 1.73s/it]
|
908 |
97%|ββββββββββ| 11021/11346 [5:33:30<09:18, 1.72s/it]
|
909 |
97%|ββββββββββ| 11022/11346 [5:33:32<09:13, 1.71s/it]
|
910 |
97%|ββββββββββ| 11023/11346 [5:33:33<09:09, 1.70s/it]
|
911 |
97%|ββββββββββ| 11024/11346 [5:33:35<09:05, 1.70s/it]
|
912 |
97%|ββββββββββ| 11025/11346 [5:33:37<09:03, 1.69s/it]
|
913 |
97%|ββββββββββ| 11026/11346 [5:33:38<09:00, 1.69s/it]
|
914 |
97%|ββββββββββ| 11027/11346 [5:33:40<08:58, 1.69s/it]
|
915 |
97%|ββββββββββ| 11028/11346 [5:33:42<08:55, 1.68s/it]
|
916 |
97%|ββββββββββ| 11029/11346 [5:33:43<08:54, 1.69s/it]
|
917 |
97%|ββββββββββ| 11030/11346 [5:33:45<08:52, 1.69s/it]
|
918 |
97%|ββββββββββ| 11031/11346 [5:33:47<08:50, 1.68s/it]
|
919 |
97%|ββββββββββ| 11032/11346 [5:33:48<08:48, 1.68s/it]
|
920 |
97%|ββββββββββ| 11033/11346 [5:33:50<08:46, 1.68s/it]
|
921 |
97%|ββββββββββ| 11034/11346 [5:33:52<08:44, 1.68s/it]
|
922 |
97%|ββββββββββ| 11035/11346 [5:33:54<08:43, 1.68s/it]
|
923 |
97%|ββββββββββ| 11036/11346 [5:33:55<08:41, 1.68s/it]
|
924 |
97%|ββββββββββ| 11037/11346 [5:33:57<08:40, 1.68s/it]
|
925 |
97%|ββββββββββ| 11038/11346 [5:33:59<08:38, 1.68s/it]
|
926 |
97%|ββββββββββ| 11039/11346 [5:34:00<08:37, 1.69s/it]
|
927 |
97%|ββββββββββ| 11040/11346 [5:34:02<08:35, 1.68s/it]
|
928 |
97%|ββββββββββ| 11041/11346 [5:34:04<08:33, 1.68s/it]
|
929 |
97%|ββββββββββ| 11042/11346 [5:34:05<08:31, 1.68s/it]
|
930 |
97%|ββββββββββ| 11043/11346 [5:34:07<08:30, 1.69s/it]
|
931 |
97%|ββββββββββ| 11044/11346 [5:34:09<08:28, 1.68s/it]
|
932 |
97%|ββββββββββ| 11045/11346 [5:34:10<08:27, 1.68s/it]
|
933 |
97%|ββββββββββ| 11046/11346 [5:34:12<08:25, 1.69s/it]
|
934 |
97%|ββββββββββ| 11047/11346 [5:34:14<08:23, 1.69s/it]
|
935 |
97%|ββββββββββ| 11048/11346 [5:34:15<08:22, 1.68s/it]
|
936 |
97%|ββββββββββ| 11049/11346 [5:34:17<08:20, 1.68s/it]
|
937 |
97%|ββββββββββ| 11050/11346 [5:34:19<08:18, 1.68s/it]
|
938 |
97%|ββββββββββ| 11051/11346 [5:34:20<08:16, 1.68s/it]
|
939 |
97%|ββββββββββ| 11052/11346 [5:34:22<08:15, 1.69s/it]
|
940 |
97%|ββββββββββ| 11053/11346 [5:34:24<08:13, 1.69s/it]
|
941 |
97%|ββββββββββ| 11054/11346 [5:34:26<08:12, 1.69s/it]
|
942 |
97%|ββββββββββ| 11055/11346 [5:34:27<08:10, 1.68s/it]
|
943 |
97%|ββββββββββ| 11056/11346 [5:34:29<08:08, 1.68s/it]
|
944 |
97%|ββββββββββ| 11057/11346 [5:34:31<08:06, 1.68s/it]
|
945 |
97%|ββββββββββ| 11058/11346 [5:34:32<08:04, 1.68s/it]
|
946 |
97%|ββββββββββ| 11059/11346 [5:34:34<08:03, 1.68s/it]
|
947 |
97%|ββββββββββ| 11060/11346 [5:34:36<08:01, 1.69s/it]
|
948 |
97%|ββββββββββ| 11061/11346 [5:34:37<08:00, 1.68s/it]
|
949 |
97%|ββββββββββ| 11062/11346 [5:34:39<07:58, 1.69s/it]
|
950 |
98%|ββββββββββ| 11063/11346 [5:34:41<07:56, 1.69s/it]
|
951 |
98%|ββββββββββ| 11064/11346 [5:34:42<07:55, 1.69s/it]
|
952 |
98%|ββββββββββ| 11065/11346 [5:34:44<07:53, 1.69s/it]
|
953 |
98%|ββββββββββ| 11066/11346 [5:34:46<07:51, 1.69s/it]
|
954 |
98%|ββββββββββ| 11067/11346 [5:34:47<07:49, 1.68s/it]
|
955 |
98%|ββββββββββ| 11068/11346 [5:34:49<07:48, 1.68s/it]
|
956 |
98%|ββββββββββ| 11069/11346 [5:34:51<07:46, 1.68s/it]
|
957 |
98%|ββββββββββ| 11070/11346 [5:34:52<07:44, 1.68s/it]
|
958 |
98%|ββββββββββ| 11071/11346 [5:34:54<07:42, 1.68s/it]
|
959 |
98%|ββββββββββ| 11072/11346 [5:34:56<07:40, 1.68s/it]
|
960 |
98%|ββββββββββ| 11073/11346 [5:34:58<07:39, 1.68s/it]
|
961 |
98%|ββββββββββ| 11074/11346 [5:34:59<07:37, 1.68s/it]
|
962 |
98%|ββββββββββ| 11075/11346 [5:35:01<07:36, 1.68s/it]
|
963 |
98%|ββββββββββ| 11076/11346 [5:35:03<07:35, 1.69s/it]
|
964 |
98%|ββββββββββ| 11077/11346 [5:35:04<07:33, 1.68s/it]
|
965 |
98%|ββββββββββ| 11078/11346 [5:35:06<07:31, 1.69s/it]
|
966 |
98%|ββββββββββ| 11079/11346 [5:35:08<07:29, 1.68s/it]
|
967 |
98%|ββββββββββ| 11080/11346 [5:35:09<07:28, 1.68s/it]
|
968 |
98%|ββββββββββ| 11081/11346 [5:35:11<07:26, 1.69s/it]
|
969 |
98%|ββββββββββ| 11082/11346 [5:35:13<07:24, 1.69s/it]
|
970 |
98%|ββββββββββ| 11083/11346 [5:35:14<07:23, 1.69s/it]
|
971 |
98%|ββββββββββ| 11084/11346 [5:35:16<07:21, 1.69s/it]
|
972 |
98%|ββββββββββ| 11085/11346 [5:35:18<07:19, 1.69s/it]
|
973 |
98%|ββββββββββ| 11086/11346 [5:35:19<07:18, 1.68s/it]
|
974 |
98%|ββββββββββ| 11087/11346 [5:35:21<07:16, 1.68s/it]
|
975 |
98%|ββββββββββ| 11088/11346 [5:35:23<07:14, 1.68s/it]
|
976 |
98%|ββββββββββ| 11089/11346 [5:35:24<07:12, 1.68s/it]
|
977 |
98%|ββββββββββ| 11090/11346 [5:35:26<07:11, 1.68s/it]
|
978 |
98%|ββββββββββ| 11091/11346 [5:35:28<07:09, 1.68s/it]
|
979 |
98%|ββββββββββ| 11092/11346 [5:35:30<07:07, 1.68s/it]
|
980 |
98%|ββββββββββ| 11093/11346 [5:35:31<07:05, 1.68s/it]
|
981 |
98%|ββββββββββ| 11094/11346 [5:35:33<07:03, 1.68s/it]
|
982 |
98%|ββββββββββ| 11095/11346 [5:35:35<07:02, 1.68s/it]
|
983 |
98%|ββββββββββ| 11096/11346 [5:35:36<07:01, 1.68s/it]
|
984 |
98%|ββββββββββ| 11097/11346 [5:35:38<06:59, 1.68s/it]
|
985 |
98%|ββββββββββ| 11098/11346 [5:35:40<06:57, 1.69s/it]
|
986 |
98%|ββββββββββ| 11099/11346 [5:35:41<06:56, 1.68s/it]
|
987 |
98%|ββββββββββ| 11100/11346 [5:35:43<06:54, 1.68s/it]
|
988 |
98%|ββββββββββ| 11101/11346 [5:35:45<06:52, 1.68s/it]
|
989 |
98%|ββββββββββ| 11102/11346 [5:35:46<06:50, 1.68s/it]
|
990 |
98%|ββββββββββ| 11103/11346 [5:35:48<06:49, 1.68s/it]
|
991 |
98%|ββββββββββ| 11104/11346 [5:35:50<07:04, 1.76s/it]
|
992 |
98%|ββββββββββ| 11105/11346 [5:35:52<06:57, 1.73s/it]
|
993 |
98%|ββββββββββ| 11106/11346 [5:35:53<06:52, 1.72s/it]
|
994 |
98%|ββββββββββ| 11107/11346 [5:35:55<06:48, 1.71s/it]
|
995 |
98%|ββββββββββ| 11108/11346 [5:35:57<06:45, 1.70s/it]
|
996 |
98%|ββββββββββ| 11109/11346 [5:35:58<06:41, 1.70s/it]
|
997 |
98%|ββββββββββ| 11110/11346 [5:36:00<06:39, 1.69s/it]
|
998 |
98%|ββββββββββ| 11111/11346 [5:36:02<06:36, 1.69s/it]
|
999 |
98%|ββββββββββ| 11112/11346 [5:36:03<06:35, 1.69s/it]
|
1000 |
98%|ββββββββββ| 11113/11346 [5:36:05<06:33, 1.69s/it]
|
1001 |
98%|ββββββββββ| 11114/11346 [5:36:07<06:31, 1.69s/it]
|
1002 |
98%|ββββββββββ| 11115/11346 [5:36:09<06:29, 1.69s/it]
|
1003 |
98%|ββββββββββ| 11116/11346 [5:36:10<06:27, 1.69s/it]
|
1004 |
98%|ββββββββββ| 11117/11346 [5:36:12<06:25, 1.68s/it]
|
1005 |
98%|ββββββββββ| 11118/11346 [5:36:14<06:24, 1.68s/it]
|
1006 |
98%|ββββββββββ| 11119/11346 [5:36:15<06:22, 1.68s/it]
|
1007 |
98%|ββββββββββ| 11120/11346 [5:36:17<06:20, 1.68s/it]
|
1008 |
98%|ββββββββββ| 11121/11346 [5:36:19<06:19, 1.69s/it]
|
1009 |
98%|ββββββββββ| 11122/11346 [5:36:20<06:17, 1.69s/it]
|
1010 |
98%|ββββββββββ| 11123/11346 [5:36:22<06:16, 1.69s/it]
|
1011 |
98%|ββββββββββ| 11124/11346 [5:36:24<06:14, 1.69s/it]
|
1012 |
98%|ββββββββββ| 11125/11346 [5:36:25<06:12, 1.69s/it]
|
1013 |
98%|ββββββββββ| 11126/11346 [5:36:27<06:10, 1.68s/it]
|
1014 |
98%|ββββββββββ| 11127/11346 [5:36:29<06:09, 1.68s/it]
|
1015 |
98%|ββββββββββ| 11128/11346 [5:36:30<06:07, 1.68s/it]
|
1016 |
98%|ββββββββββ| 11129/11346 [5:36:32<06:05, 1.68s/it]
|
1017 |
98%|ββββββββββ| 11130/11346 [5:36:34<06:03, 1.68s/it]
|
1018 |
98%|ββββββββββ| 11131/11346 [5:36:35<06:01, 1.68s/it]
|
1019 |
98%|ββββββββββ| 11132/11346 [5:36:37<06:00, 1.68s/it]
|
1020 |
98%|ββββββββββ| 11133/11346 [5:36:39<05:58, 1.68s/it]
|
1021 |
98%|ββββββββββ| 11134/11346 [5:36:41<05:56, 1.68s/it]
|
1022 |
98%|ββββββββββ| 11135/11346 [5:36:42<05:55, 1.68s/it]
|
1023 |
98%|ββββββββββ| 11136/11346 [5:36:44<05:53, 1.68s/it]
|
1024 |
98%|ββββββββββ| 11137/11346 [5:36:46<05:51, 1.68s/it]
|
1025 |
98%|ββββββββββ| 11138/11346 [5:36:47<05:50, 1.68s/it]
|
1026 |
98%|ββββββββββ| 11139/11346 [5:36:49<05:48, 1.68s/it]
|
1027 |
98%|ββββββββββ| 11140/11346 [5:36:51<05:47, 1.68s/it]
|
1028 |
98%|ββββββββββ| 11141/11346 [5:36:52<05:45, 1.68s/it]
|
1029 |
98%|ββββββββββ| 11142/11346 [5:36:54<05:43, 1.68s/it]
|
1030 |
98%|ββββββββββ| 11143/11346 [5:36:56<05:41, 1.68s/it]
|
1031 |
98%|ββββββββββ| 11144/11346 [5:36:57<05:40, 1.69s/it]
|
1032 |
98%|ββββββββββ| 11145/11346 [5:36:59<05:38, 1.68s/it]
|
1033 |
98%|ββββββββββ| 11146/11346 [5:37:01<05:37, 1.69s/it]
|
1034 |
98%|ββββββββββ| 11147/11346 [5:37:02<05:35, 1.68s/it]
|
1035 |
98%|ββββββββββ| 11148/11346 [5:37:04<05:33, 1.68s/it]
|
1036 |
98%|ββββββββββ| 11149/11346 [5:37:06<05:31, 1.68s/it]
|
1037 |
98%|ββββββββββ| 11150/11346 [5:37:07<05:30, 1.68s/it]
|
1038 |
98%|ββββββββββ| 11151/11346 [5:37:09<05:28, 1.68s/it]
|
1039 |
98%|ββββββββββ| 11152/11346 [5:37:11<05:26, 1.68s/it]
|
1040 |
98%|ββββββββββ| 11153/11346 [5:37:13<05:24, 1.68s/it]
|
1041 |
98%|ββββββββββ| 11154/11346 [5:37:14<05:22, 1.68s/it]
|
1042 |
98%|ββββββββββ| 11155/11346 [5:37:16<05:21, 1.68s/it]
|
1043 |
98%|ββββββββββ| 11156/11346 [5:37:18<05:19, 1.68s/it]
|
1044 |
98%|ββββββββββ| 11157/11346 [5:37:19<05:17, 1.68s/it]
|
1045 |
98%|ββββββββββ| 11158/11346 [5:37:21<05:16, 1.68s/it]
|
1046 |
98%|ββββββββββ| 11159/11346 [5:37:23<05:14, 1.68s/it]
|
1047 |
98%|ββββββββββ| 11160/11346 [5:37:24<05:12, 1.68s/it]
|
1048 |
98%|ββββββββββ| 11161/11346 [5:37:26<05:11, 1.68s/it]
|
1049 |
98%|ββββββββββ| 11162/11346 [5:37:28<05:09, 1.68s/it]
|
1050 |
98%|ββββββββββ| 11163/11346 [5:37:29<05:07, 1.68s/it]
|
1051 |
98%|ββββββββββ| 11164/11346 [5:37:31<05:06, 1.68s/it]
|
1052 |
98%|ββββββββββ| 11165/11346 [5:37:33<05:04, 1.68s/it]
|
1053 |
98%|ββββββββββ| 11166/11346 [5:37:34<05:03, 1.68s/it]
|
1054 |
98%|ββββββββββ| 11167/11346 [5:37:36<05:01, 1.68s/it]
|
1055 |
98%|ββββββββββ| 11168/11346 [5:37:38<04:59, 1.68s/it]
|
1056 |
98%|ββββββββββ| 11169/11346 [5:37:39<04:58, 1.69s/it]
|
1057 |
98%|ββββββββββ| 11170/11346 [5:37:41<04:56, 1.69s/it]
|
1058 |
98%|ββββββββββ| 11171/11346 [5:37:43<04:54, 1.68s/it]
|
1059 |
98%|ββββββββββ| 11172/11346 [5:37:44<04:53, 1.68s/it]
|
1060 |
98%|ββββββββββ| 11173/11346 [5:37:46<04:51, 1.68s/it]
|
1061 |
98%|ββββββββββ| 11174/11346 [5:37:48<04:49, 1.68s/it]
|
1062 |
98%|ββββββββββ| 11175/11346 [5:37:50<04:47, 1.68s/it]
|
1063 |
99%|ββββββββββ| 11176/11346 [5:37:51<04:46, 1.68s/it]
|
1064 |
99%|ββββββββββ| 11177/11346 [5:37:53<04:44, 1.68s/it]
|
1065 |
99%|ββββββββββ| 11178/11346 [5:37:55<04:42, 1.68s/it]
|
1066 |
99%|ββββββββββ| 11179/11346 [5:37:56<04:41, 1.68s/it]
|
1067 |
99%|ββββββββββ| 11180/11346 [5:37:58<04:39, 1.68s/it]
|
1068 |
99%|ββββββββββ| 11181/11346 [5:38:00<04:37, 1.68s/it]
|
1069 |
99%|ββββββββββ| 11182/11346 [5:38:01<04:36, 1.68s/it]
|
1070 |
99%|ββββββββββ| 11183/11346 [5:38:03<04:34, 1.68s/it]
|
1071 |
99%|ββββββββββ| 11184/11346 [5:38:05<04:32, 1.68s/it]
|
1072 |
99%|ββββββββββ| 11185/11346 [5:38:06<04:31, 1.68s/it]
|
1073 |
99%|ββββββββββ| 11186/11346 [5:38:08<04:29, 1.68s/it]
|
1074 |
99%|ββββββββββ| 11187/11346 [5:38:10<04:27, 1.68s/it]
|
1075 |
99%|ββββββββββ| 11188/11346 [5:38:11<04:26, 1.68s/it]
|
1076 |
99%|ββββββββββ| 11189/11346 [5:38:13<04:24, 1.68s/it]
|
1077 |
99%|ββββββββββ| 11190/11346 [5:38:15<04:22, 1.68s/it]
|
1078 |
99%|ββββββββββ| 11191/11346 [5:38:16<04:21, 1.68s/it]
|
1079 |
99%|ββββββββββ| 11192/11346 [5:38:18<04:19, 1.68s/it]
|
1080 |
99%|ββββββββββ| 11193/11346 [5:38:20<04:17, 1.68s/it]
|
1081 |
99%|ββββββββββ| 11194/11346 [5:38:22<04:16, 1.68s/it]
|
1082 |
99%|ββββββββββ| 11195/11346 [5:38:23<04:14, 1.68s/it]
|
1083 |
99%|ββββββββββ| 11196/11346 [5:38:25<04:12, 1.68s/it]
|
1084 |
99%|ββββββββββ| 11197/11346 [5:38:27<04:10, 1.68s/it]
|
1085 |
99%|ββββββββββ| 11198/11346 [5:38:28<04:09, 1.68s/it]
|
1086 |
99%|ββββββββββ| 11199/11346 [5:38:30<04:07, 1.68s/it]
|
1087 |
99%|ββββββββββ| 11200/11346 [5:38:32<04:05, 1.68s/it]
|
1088 |
99%|ββββββββββ| 11201/11346 [5:38:33<04:03, 1.68s/it]
|
1089 |
99%|ββββββββββ| 11202/11346 [5:38:35<04:02, 1.68s/it]
|
1090 |
99%|ββββββββββ| 11203/11346 [5:38:37<04:00, 1.68s/it]
|
1091 |
99%|ββββββββββ| 11204/11346 [5:38:38<03:59, 1.68s/it]
|
1092 |
99%|ββββββββββ| 11205/11346 [5:38:40<03:57, 1.68s/it]
|
1093 |
99%|ββββββββββ| 11206/11346 [5:38:42<03:55, 1.68s/it]
|
1094 |
99%|ββββββββββ| 11207/11346 [5:38:43<03:54, 1.68s/it]
|
1095 |
99%|ββββββββββ| 11208/11346 [5:38:45<03:52, 1.68s/it]
|
1096 |
99%|ββββββββββ| 11209/11346 [5:38:47<03:50, 1.68s/it]
|
1097 |
99%|ββββββββββ| 11210/11346 [5:38:48<03:49, 1.68s/it]
|
1098 |
99%|ββββββββββ| 11211/11346 [5:38:50<03:47, 1.68s/it]
|
1099 |
99%|ββββββββββ| 11212/11346 [5:38:52<03:45, 1.68s/it]
|
1100 |
99%|ββββββββββ| 11213/11346 [5:38:54<03:43, 1.68s/it]
|
1101 |
99%|ββββββββββ| 11214/11346 [5:38:55<03:42, 1.69s/it]
|
1102 |
99%|ββββββββββ| 11215/11346 [5:38:57<03:40, 1.68s/it]
|
1103 |
99%|ββββββββββ| 11216/11346 [5:38:59<03:39, 1.69s/it]
|
1104 |
99%|ββββββββββ| 11217/11346 [5:39:00<03:37, 1.69s/it]
|
1105 |
99%|ββββββββββ| 11218/11346 [5:39:02<03:35, 1.68s/it]
|
1106 |
99%|ββββββββββ| 11219/11346 [5:39:04<03:33, 1.68s/it]
|
1107 |
99%|ββββββββββ| 11220/11346 [5:39:05<03:32, 1.69s/it]
|
1108 |
99%|ββββββββββ| 11221/11346 [5:39:07<03:30, 1.69s/it]
|
1109 |
99%|ββββββββββ| 11222/11346 [5:39:09<03:28, 1.69s/it]
|
1110 |
99%|ββββββββββ| 11223/11346 [5:39:10<03:27, 1.69s/it]
|
1111 |
99%|ββββββββββ| 11224/11346 [5:39:12<03:25, 1.68s/it]
|
1112 |
99%|ββββββββββ| 11225/11346 [5:39:14<03:23, 1.69s/it]
|
1113 |
99%|ββββββββββ| 11226/11346 [5:39:15<03:22, 1.69s/it]
|
1114 |
99%|ββββββββββ| 11227/11346 [5:39:17<03:20, 1.68s/it]
|
1115 |
99%|ββββββββββ| 11228/11346 [5:39:19<03:18, 1.68s/it]
|
1116 |
99%|ββββββββββ| 11229/11346 [5:39:20<03:17, 1.69s/it]
|
1117 |
99%|ββββββββββ| 11230/11346 [5:39:22<03:15, 1.69s/it]
|
1118 |
99%|ββββββββββ| 11231/11346 [5:39:24<03:13, 1.69s/it]
|
1119 |
99%|ββββββββββ| 11232/11346 [5:39:26<03:12, 1.68s/it]
|
1120 |
99%|ββββββββββ| 11233/11346 [5:39:27<03:10, 1.68s/it]
|
1121 |
99%|ββββββββββ| 11234/11346 [5:39:29<03:08, 1.68s/it]
|
1122 |
99%|ββββββββββ| 11235/11346 [5:39:31<03:06, 1.68s/it]
|
1123 |
99%|ββββββββββ| 11236/11346 [5:39:32<03:05, 1.68s/it]
|
1124 |
99%|ββββββββββ| 11237/11346 [5:39:34<03:03, 1.68s/it]
|
1125 |
99%|ββββββββββ| 11238/11346 [5:39:36<03:01, 1.68s/it]
|
1126 |
99%|ββββββββββ| 11239/11346 [5:39:37<03:00, 1.68s/it]
|
1127 |
99%|ββββββββββ| 11240/11346 [5:39:39<02:58, 1.68s/it]
|
1128 |
99%|ββββββββββ| 11241/11346 [5:39:41<02:56, 1.68s/it]
|
1129 |
99%|ββββββββββ| 11242/11346 [5:39:42<02:55, 1.68s/it]
|
1130 |
99%|ββββββββββ| 11243/11346 [5:39:44<02:53, 1.68s/it]
|
1131 |
99%|ββββββββββ| 11244/11346 [5:39:46<02:51, 1.68s/it]
|
1132 |
99%|ββββββββββ| 11245/11346 [5:39:47<02:50, 1.68s/it]
|
1133 |
99%|ββββββββββ| 11246/11346 [5:39:49<02:48, 1.68s/it]
|
1134 |
99%|ββββββββββ| 11247/11346 [5:39:51<02:46, 1.69s/it]
|
1135 |
99%|ββββββββββ| 11248/11346 [5:39:52<02:45, 1.69s/it]
|
1136 |
99%|ββββββββββ| 11249/11346 [5:39:54<02:43, 1.69s/it]
|
1137 |
99%|ββββββββββ| 11250/11346 [5:39:56<02:41, 1.69s/it]
|
1138 |
99%|ββββββββββ| 11251/11346 [5:39:58<02:40, 1.69s/it]
|
1139 |
99%|ββββββββββ| 11252/11346 [5:39:59<02:38, 1.69s/it]
|
1140 |
99%|ββββββββββ| 11253/11346 [5:40:01<02:36, 1.69s/it]
|
1141 |
99%|ββββββββββ| 11254/11346 [5:40:03<02:34, 1.68s/it]
|
1142 |
99%|ββββββββββ| 11255/11346 [5:40:04<02:33, 1.68s/it]
|
1143 |
99%|ββββββββββ| 11256/11346 [5:40:06<02:31, 1.69s/it]
|
1144 |
99%|ββββββββββ| 11257/11346 [5:40:08<02:29, 1.68s/it]
|
1145 |
99%|ββββββββββ| 11258/11346 [5:40:09<02:28, 1.68s/it]
|
1146 |
99%|ββββββββββ| 11259/11346 [5:40:11<02:26, 1.68s/it]
|
1147 |
99%|ββββββββββ| 11260/11346 [5:40:13<02:24, 1.68s/it]
|
1148 |
99%|ββββββββββ| 11261/11346 [5:40:14<02:23, 1.68s/it]
|
1149 |
99%|ββββββββββ| 11262/11346 [5:40:16<02:21, 1.68s/it]
|
1150 |
99%|ββββββββββ| 11263/11346 [5:40:18<02:19, 1.68s/it]
|
1151 |
99%|ββββββββββ| 11264/11346 [5:40:19<02:17, 1.68s/it]
|
1152 |
99%|ββββββββββ| 11265/11346 [5:40:21<02:16, 1.68s/it]
|
1153 |
99%|ββββββββββ| 11266/11346 [5:40:23<02:14, 1.68s/it]
|
1154 |
99%|ββββββββββ| 11267/11346 [5:40:24<02:12, 1.68s/it]
|
1155 |
99%|ββββββββββ| 11268/11346 [5:40:26<02:11, 1.68s/it]
|
1156 |
99%|ββββββββββ| 11269/11346 [5:40:28<02:09, 1.68s/it]
|
1157 |
99%|ββββββββββ| 11270/11346 [5:40:30<02:07, 1.68s/it]
|
1158 |
99%|ββββββββββ| 11271/11346 [5:40:31<02:06, 1.68s/it]
|
1159 |
99%|ββββββββββ| 11272/11346 [5:40:33<02:04, 1.68s/it]
|
1160 |
99%|ββββββββββ| 11273/11346 [5:40:35<02:02, 1.68s/it]
|
1161 |
99%|ββββββββββ| 11274/11346 [5:40:36<02:01, 1.68s/it]
|
1162 |
99%|ββββββββββ| 11275/11346 [5:40:38<01:59, 1.68s/it]
|
1163 |
99%|ββββββββββ| 11276/11346 [5:40:40<01:57, 1.68s/it]
|
1164 |
99%|ββββββββββ| 11277/11346 [5:40:41<01:56, 1.68s/it]
|
1165 |
99%|ββββββββββ| 11278/11346 [5:40:43<01:54, 1.68s/it]
|
1166 |
99%|ββββββββββ| 11279/11346 [5:40:45<01:52, 1.68s/it]
|
1167 |
99%|ββββββββββ| 11280/11346 [5:40:46<01:51, 1.69s/it]
|
1168 |
99%|ββββββββββ| 11281/11346 [5:40:48<01:49, 1.69s/it]
|
1169 |
99%|ββββββββββ| 11282/11346 [5:40:50<01:47, 1.68s/it]
|
1170 |
99%|ββββββββββ| 11283/11346 [5:40:51<01:46, 1.68s/it]
|
1171 |
99%|ββββββββββ| 11284/11346 [5:40:53<01:44, 1.69s/it]
|
1172 |
99%|ββββββββββ| 11285/11346 [5:40:55<01:42, 1.68s/it]
|
1173 |
99%|ββββββββββ| 11286/11346 [5:40:56<01:41, 1.68s/it]
|
1174 |
99%|ββββββββββ| 11287/11346 [5:40:58<01:39, 1.68s/it]
|
1175 |
99%|ββββββββββ| 11288/11346 [5:41:00<01:37, 1.68s/it]
|
1176 |
99%|ββββββββββ| 11289/11346 [5:41:02<01:36, 1.69s/it]
|
|
|
|
|
|
|
|
|
|
|
1177 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
885 |
[INFO|tokenization_utils_base.py:2512] 2024-06-04 06:34:46,021 >> Special tokens file saved in ./training_outputs_job_116987_1_04-06_01-01/special_tokens_map.json
|
886 |
/home/dshteyma/miniconda3/lib/python3.9/site-packages/torch/nn/parallel/_functions.py:68: UserWarning: Was asked to gather along dimension 0, but all input tensors were scalars; will instead unsqueeze and return a vector.
|
887 |
warnings.warn('Was asked to gather along dimension 0, but all '
|
888 |
+
|
889 |
97%|ββββββββββ| 11001/11346 [5:32:56<3:58:05, 41.41s/it]
|
890 |
97%|ββββββββββ| 11002/11346 [5:32:58<2:49:05, 29.49s/it]
|
891 |
97%|ββββββββββ| 11003/11346 [5:33:00<2:00:54, 21.15s/it]
|
892 |
97%|ββββββββββ| 11004/11346 [5:33:01<1:27:16, 15.31s/it]
|
893 |
97%|ββββββββββ| 11005/11346 [5:33:03<1:03:47, 11.22s/it]
|
894 |
97%|ββββββββββ| 11006/11346 [5:33:05<47:22, 8.36s/it]
|
895 |
97%|ββββββββββ| 11007/11346 [5:33:06<35:56, 6.36s/it]
|
896 |
97%|ββββββββββ| 11008/11346 [5:33:08<27:55, 4.96s/it]
|
897 |
97%|ββββββββββ| 11009/11346 [5:33:10<22:20, 3.98s/it]
|
898 |
97%|ββββββββββ| 11010/11346 [5:33:11<18:24, 3.29s/it]
|
899 |
97%|ββββββββββ| 11011/11346 [5:33:13<15:40, 2.81s/it]
|
900 |
97%|ββββββββββ| 11012/11346 [5:33:15<13:44, 2.47s/it]
|
901 |
97%|ββββββββββ| 11013/11346 [5:33:16<12:23, 2.23s/it]
|
902 |
97%|ββββββββββ| 11014/11346 [5:33:18<11:26, 2.07s/it]
|
903 |
97%|ββββββββββ| 11015/11346 [5:33:20<10:46, 1.95s/it]
|
904 |
97%|ββββββββββ| 11016/11346 [5:33:22<10:17, 1.87s/it]
|
905 |
97%|ββββββββββ| 11017/11346 [5:33:23<09:56, 1.81s/it]
|
906 |
97%|ββββββββββ| 11018/11346 [5:33:25<09:42, 1.77s/it]
|
907 |
97%|ββββββββββ| 11019/11346 [5:33:27<09:31, 1.75s/it]
|
908 |
97%|ββββββββββ| 11020/11346 [5:33:28<09:23, 1.73s/it]
|
909 |
97%|ββββββββββ| 11021/11346 [5:33:30<09:18, 1.72s/it]
|
910 |
97%|ββββββββββ| 11022/11346 [5:33:32<09:13, 1.71s/it]
|
911 |
97%|ββββββββββ| 11023/11346 [5:33:33<09:09, 1.70s/it]
|
912 |
97%|ββββββββββ| 11024/11346 [5:33:35<09:05, 1.70s/it]
|
913 |
97%|ββββββββββ| 11025/11346 [5:33:37<09:03, 1.69s/it]
|
914 |
97%|ββββββββββ| 11026/11346 [5:33:38<09:00, 1.69s/it]
|
915 |
97%|ββββββββββ| 11027/11346 [5:33:40<08:58, 1.69s/it]
|
916 |
97%|ββββββββββ| 11028/11346 [5:33:42<08:55, 1.68s/it]
|
917 |
97%|ββββββββββ| 11029/11346 [5:33:43<08:54, 1.69s/it]
|
918 |
97%|ββββββββββ| 11030/11346 [5:33:45<08:52, 1.69s/it]
|
919 |
97%|ββββββββββ| 11031/11346 [5:33:47<08:50, 1.68s/it]
|
920 |
97%|ββββββββββ| 11032/11346 [5:33:48<08:48, 1.68s/it]
|
921 |
97%|ββββββββββ| 11033/11346 [5:33:50<08:46, 1.68s/it]
|
922 |
97%|ββββββββββ| 11034/11346 [5:33:52<08:44, 1.68s/it]
|
923 |
97%|ββββββββββ| 11035/11346 [5:33:54<08:43, 1.68s/it]
|
924 |
97%|ββββββββββ| 11036/11346 [5:33:55<08:41, 1.68s/it]
|
925 |
97%|ββββββββββ| 11037/11346 [5:33:57<08:40, 1.68s/it]
|
926 |
97%|ββββββββββ| 11038/11346 [5:33:59<08:38, 1.68s/it]
|
927 |
97%|ββββββββββ| 11039/11346 [5:34:00<08:37, 1.69s/it]
|
928 |
97%|ββββββββββ| 11040/11346 [5:34:02<08:35, 1.68s/it]
|
929 |
97%|ββββββββββ| 11041/11346 [5:34:04<08:33, 1.68s/it]
|
930 |
97%|ββββββββββ| 11042/11346 [5:34:05<08:31, 1.68s/it]
|
931 |
97%|ββββββββββ| 11043/11346 [5:34:07<08:30, 1.69s/it]
|
932 |
97%|ββββββββββ| 11044/11346 [5:34:09<08:28, 1.68s/it]
|
933 |
97%|ββββββββββ| 11045/11346 [5:34:10<08:27, 1.68s/it]
|
934 |
97%|ββββββββββ| 11046/11346 [5:34:12<08:25, 1.69s/it]
|
935 |
97%|ββββββββββ| 11047/11346 [5:34:14<08:23, 1.69s/it]
|
936 |
97%|ββββββββββ| 11048/11346 [5:34:15<08:22, 1.68s/it]
|
937 |
97%|ββββββββββ| 11049/11346 [5:34:17<08:20, 1.68s/it]
|
938 |
97%|ββββββββββ| 11050/11346 [5:34:19<08:18, 1.68s/it]
|
939 |
97%|ββββββββββ| 11051/11346 [5:34:20<08:16, 1.68s/it]
|
940 |
97%|ββββββββββ| 11052/11346 [5:34:22<08:15, 1.69s/it]
|
941 |
97%|ββββββββββ| 11053/11346 [5:34:24<08:13, 1.69s/it]
|
942 |
97%|ββββββββββ| 11054/11346 [5:34:26<08:12, 1.69s/it]
|
943 |
97%|ββββββββββ| 11055/11346 [5:34:27<08:10, 1.68s/it]
|
944 |
97%|ββββββββββ| 11056/11346 [5:34:29<08:08, 1.68s/it]
|
945 |
97%|ββββββββββ| 11057/11346 [5:34:31<08:06, 1.68s/it]
|
946 |
97%|ββββββββββ| 11058/11346 [5:34:32<08:04, 1.68s/it]
|
947 |
97%|ββββββββββ| 11059/11346 [5:34:34<08:03, 1.68s/it]
|
948 |
97%|ββββββββββ| 11060/11346 [5:34:36<08:01, 1.69s/it]
|
949 |
97%|ββββββββββ| 11061/11346 [5:34:37<08:00, 1.68s/it]
|
950 |
97%|ββββββββββ| 11062/11346 [5:34:39<07:58, 1.69s/it]
|
951 |
98%|ββββββββββ| 11063/11346 [5:34:41<07:56, 1.69s/it]
|
952 |
98%|ββββββββββ| 11064/11346 [5:34:42<07:55, 1.69s/it]
|
953 |
98%|ββββββββββ| 11065/11346 [5:34:44<07:53, 1.69s/it]
|
954 |
98%|ββββββββββ| 11066/11346 [5:34:46<07:51, 1.69s/it]
|
955 |
98%|ββββββββββ| 11067/11346 [5:34:47<07:49, 1.68s/it]
|
956 |
98%|ββββββββββ| 11068/11346 [5:34:49<07:48, 1.68s/it]
|
957 |
98%|ββββββββββ| 11069/11346 [5:34:51<07:46, 1.68s/it]
|
958 |
98%|ββββββββββ| 11070/11346 [5:34:52<07:44, 1.68s/it]
|
959 |
98%|ββββββββββ| 11071/11346 [5:34:54<07:42, 1.68s/it]
|
960 |
98%|ββββββββββ| 11072/11346 [5:34:56<07:40, 1.68s/it]
|
961 |
98%|ββββββββββ| 11073/11346 [5:34:58<07:39, 1.68s/it]
|
962 |
98%|ββββββββββ| 11074/11346 [5:34:59<07:37, 1.68s/it]
|
963 |
98%|ββββββββββ| 11075/11346 [5:35:01<07:36, 1.68s/it]
|
964 |
98%|ββββββββββ| 11076/11346 [5:35:03<07:35, 1.69s/it]
|
965 |
98%|ββββββββββ| 11077/11346 [5:35:04<07:33, 1.68s/it]
|
966 |
98%|ββββββββββ| 11078/11346 [5:35:06<07:31, 1.69s/it]
|
967 |
98%|ββββββββββ| 11079/11346 [5:35:08<07:29, 1.68s/it]
|
968 |
98%|ββββββββββ| 11080/11346 [5:35:09<07:28, 1.68s/it]
|
969 |
98%|ββββββββββ| 11081/11346 [5:35:11<07:26, 1.69s/it]
|
970 |
98%|ββββββββββ| 11082/11346 [5:35:13<07:24, 1.69s/it]
|
971 |
98%|ββββββββββ| 11083/11346 [5:35:14<07:23, 1.69s/it]
|
972 |
98%|ββββββββββ| 11084/11346 [5:35:16<07:21, 1.69s/it]
|
973 |
98%|ββββββββββ| 11085/11346 [5:35:18<07:19, 1.69s/it]
|
974 |
98%|ββββββββββ| 11086/11346 [5:35:19<07:18, 1.68s/it]
|
975 |
98%|ββββββββββ| 11087/11346 [5:35:21<07:16, 1.68s/it]
|
976 |
98%|ββββββββββ| 11088/11346 [5:35:23<07:14, 1.68s/it]
|
977 |
98%|ββββββββββ| 11089/11346 [5:35:24<07:12, 1.68s/it]
|
978 |
98%|ββββββββββ| 11090/11346 [5:35:26<07:11, 1.68s/it]
|
979 |
98%|ββββββββββ| 11091/11346 [5:35:28<07:09, 1.68s/it]
|
980 |
98%|ββββββββββ| 11092/11346 [5:35:30<07:07, 1.68s/it]
|
981 |
98%|ββββββββββ| 11093/11346 [5:35:31<07:05, 1.68s/it]
|
982 |
98%|ββββββββββ| 11094/11346 [5:35:33<07:03, 1.68s/it]
|
983 |
98%|ββββββββββ| 11095/11346 [5:35:35<07:02, 1.68s/it]
|
984 |
98%|ββββββββββ| 11096/11346 [5:35:36<07:01, 1.68s/it]
|
985 |
98%|ββββββββββ| 11097/11346 [5:35:38<06:59, 1.68s/it]
|
986 |
98%|ββββββββββ| 11098/11346 [5:35:40<06:57, 1.69s/it]
|
987 |
98%|ββββββββββ| 11099/11346 [5:35:41<06:56, 1.68s/it]
|
988 |
98%|ββββββββββ| 11100/11346 [5:35:43<06:54, 1.68s/it]
|
989 |
98%|ββββββββββ| 11101/11346 [5:35:45<06:52, 1.68s/it]
|
990 |
98%|ββββββββββ| 11102/11346 [5:35:46<06:50, 1.68s/it]
|
991 |
98%|ββββββββββ| 11103/11346 [5:35:48<06:49, 1.68s/it]
|
992 |
98%|ββββββββββ| 11104/11346 [5:35:50<07:04, 1.76s/it]
|
993 |
98%|ββββββββββ| 11105/11346 [5:35:52<06:57, 1.73s/it]
|
994 |
98%|ββββββββββ| 11106/11346 [5:35:53<06:52, 1.72s/it]
|
995 |
98%|ββββββββββ| 11107/11346 [5:35:55<06:48, 1.71s/it]
|
996 |
98%|ββββββββββ| 11108/11346 [5:35:57<06:45, 1.70s/it]
|
997 |
98%|ββββββββββ| 11109/11346 [5:35:58<06:41, 1.70s/it]
|
998 |
98%|ββββββββββ| 11110/11346 [5:36:00<06:39, 1.69s/it]
|
999 |
98%|ββββββββββ| 11111/11346 [5:36:02<06:36, 1.69s/it]
|
1000 |
98%|ββββββββββ| 11112/11346 [5:36:03<06:35, 1.69s/it]
|
1001 |
98%|ββββββββββ| 11113/11346 [5:36:05<06:33, 1.69s/it]
|
1002 |
98%|ββββββββββ| 11114/11346 [5:36:07<06:31, 1.69s/it]
|
1003 |
98%|ββββββββββ| 11115/11346 [5:36:09<06:29, 1.69s/it]
|
1004 |
98%|ββββββββββ| 11116/11346 [5:36:10<06:27, 1.69s/it]
|
1005 |
98%|ββββββββββ| 11117/11346 [5:36:12<06:25, 1.68s/it]
|
1006 |
98%|ββββββββββ| 11118/11346 [5:36:14<06:24, 1.68s/it]
|
1007 |
98%|ββββββββββ| 11119/11346 [5:36:15<06:22, 1.68s/it]
|
1008 |
98%|ββββββββββ| 11120/11346 [5:36:17<06:20, 1.68s/it]
|
1009 |
98%|ββββββββββ| 11121/11346 [5:36:19<06:19, 1.69s/it]
|
1010 |
98%|ββββββββββ| 11122/11346 [5:36:20<06:17, 1.69s/it]
|
1011 |
98%|ββββββββββ| 11123/11346 [5:36:22<06:16, 1.69s/it]
|
1012 |
98%|ββββββββββ| 11124/11346 [5:36:24<06:14, 1.69s/it]
|
1013 |
98%|ββββββββββ| 11125/11346 [5:36:25<06:12, 1.69s/it]
|
1014 |
98%|ββββββββββ| 11126/11346 [5:36:27<06:10, 1.68s/it]
|
1015 |
98%|ββββββββββ| 11127/11346 [5:36:29<06:09, 1.68s/it]
|
1016 |
98%|ββββββββββ| 11128/11346 [5:36:30<06:07, 1.68s/it]
|
1017 |
98%|ββββββββββ| 11129/11346 [5:36:32<06:05, 1.68s/it]
|
1018 |
98%|ββββββββββ| 11130/11346 [5:36:34<06:03, 1.68s/it]
|
1019 |
98%|ββββββββββ| 11131/11346 [5:36:35<06:01, 1.68s/it]
|
1020 |
98%|ββββββββββ| 11132/11346 [5:36:37<06:00, 1.68s/it]
|
1021 |
98%|ββββββββββ| 11133/11346 [5:36:39<05:58, 1.68s/it]
|
1022 |
98%|ββββββββββ| 11134/11346 [5:36:41<05:56, 1.68s/it]
|
1023 |
98%|ββββββββββ| 11135/11346 [5:36:42<05:55, 1.68s/it]
|
1024 |
98%|ββββββββββ| 11136/11346 [5:36:44<05:53, 1.68s/it]
|
1025 |
98%|ββββββββββ| 11137/11346 [5:36:46<05:51, 1.68s/it]
|
1026 |
98%|ββββββββββ| 11138/11346 [5:36:47<05:50, 1.68s/it]
|
1027 |
98%|ββββββββββ| 11139/11346 [5:36:49<05:48, 1.68s/it]
|
1028 |
98%|ββββββββββ| 11140/11346 [5:36:51<05:47, 1.68s/it]
|
1029 |
98%|ββββββββββ| 11141/11346 [5:36:52<05:45, 1.68s/it]
|
1030 |
98%|ββββββββββ| 11142/11346 [5:36:54<05:43, 1.68s/it]
|
1031 |
98%|ββββββββββ| 11143/11346 [5:36:56<05:41, 1.68s/it]
|
1032 |
98%|ββββββββββ| 11144/11346 [5:36:57<05:40, 1.69s/it]
|
1033 |
98%|ββββββββββ| 11145/11346 [5:36:59<05:38, 1.68s/it]
|
1034 |
98%|ββββββββββ| 11146/11346 [5:37:01<05:37, 1.69s/it]
|
1035 |
98%|ββββββββββ| 11147/11346 [5:37:02<05:35, 1.68s/it]
|
1036 |
98%|ββββββββββ| 11148/11346 [5:37:04<05:33, 1.68s/it]
|
1037 |
98%|ββββββββββ| 11149/11346 [5:37:06<05:31, 1.68s/it]
|
1038 |
98%|ββββββββββ| 11150/11346 [5:37:07<05:30, 1.68s/it]
|
1039 |
98%|ββββββββββ| 11151/11346 [5:37:09<05:28, 1.68s/it]
|
1040 |
98%|ββββββββββ| 11152/11346 [5:37:11<05:26, 1.68s/it]
|
1041 |
98%|ββββββββββ| 11153/11346 [5:37:13<05:24, 1.68s/it]
|
1042 |
98%|ββββββββββ| 11154/11346 [5:37:14<05:22, 1.68s/it]
|
1043 |
98%|ββββββββββ| 11155/11346 [5:37:16<05:21, 1.68s/it]
|
1044 |
98%|ββββββββββ| 11156/11346 [5:37:18<05:19, 1.68s/it]
|
1045 |
98%|ββββββββββ| 11157/11346 [5:37:19<05:17, 1.68s/it]
|
1046 |
98%|ββββββββββ| 11158/11346 [5:37:21<05:16, 1.68s/it]
|
1047 |
98%|ββββββββββ| 11159/11346 [5:37:23<05:14, 1.68s/it]
|
1048 |
98%|ββββββββββ| 11160/11346 [5:37:24<05:12, 1.68s/it]
|
1049 |
98%|ββββββββββ| 11161/11346 [5:37:26<05:11, 1.68s/it]
|
1050 |
98%|ββββββββββ| 11162/11346 [5:37:28<05:09, 1.68s/it]
|
1051 |
98%|ββββββββββ| 11163/11346 [5:37:29<05:07, 1.68s/it]
|
1052 |
98%|ββββββββββ| 11164/11346 [5:37:31<05:06, 1.68s/it]
|
1053 |
98%|ββββββββββ| 11165/11346 [5:37:33<05:04, 1.68s/it]
|
1054 |
98%|ββββββββββ| 11166/11346 [5:37:34<05:03, 1.68s/it]
|
1055 |
98%|ββββββββββ| 11167/11346 [5:37:36<05:01, 1.68s/it]
|
1056 |
98%|ββββββββββ| 11168/11346 [5:37:38<04:59, 1.68s/it]
|
1057 |
98%|ββββββββββ| 11169/11346 [5:37:39<04:58, 1.69s/it]
|
1058 |
98%|ββββββββββ| 11170/11346 [5:37:41<04:56, 1.69s/it]
|
1059 |
98%|ββββββββββ| 11171/11346 [5:37:43<04:54, 1.68s/it]
|
1060 |
98%|ββββββββββ| 11172/11346 [5:37:44<04:53, 1.68s/it]
|
1061 |
98%|ββββββββββ| 11173/11346 [5:37:46<04:51, 1.68s/it]
|
1062 |
98%|ββββββββββ| 11174/11346 [5:37:48<04:49, 1.68s/it]
|
1063 |
98%|ββββββββββ| 11175/11346 [5:37:50<04:47, 1.68s/it]
|
1064 |
99%|ββββββββββ| 11176/11346 [5:37:51<04:46, 1.68s/it]
|
1065 |
99%|ββββββββββ| 11177/11346 [5:37:53<04:44, 1.68s/it]
|
1066 |
99%|ββββββββββ| 11178/11346 [5:37:55<04:42, 1.68s/it]
|
1067 |
99%|ββββββββββ| 11179/11346 [5:37:56<04:41, 1.68s/it]
|
1068 |
99%|ββββββββββ| 11180/11346 [5:37:58<04:39, 1.68s/it]
|
1069 |
99%|ββββββββββ| 11181/11346 [5:38:00<04:37, 1.68s/it]
|
1070 |
99%|ββββββββββ| 11182/11346 [5:38:01<04:36, 1.68s/it]
|
1071 |
99%|ββββββββββ| 11183/11346 [5:38:03<04:34, 1.68s/it]
|
1072 |
99%|ββββββββββ| 11184/11346 [5:38:05<04:32, 1.68s/it]
|
1073 |
99%|ββββββββββ| 11185/11346 [5:38:06<04:31, 1.68s/it]
|
1074 |
99%|ββββββββββ| 11186/11346 [5:38:08<04:29, 1.68s/it]
|
1075 |
99%|ββββββββββ| 11187/11346 [5:38:10<04:27, 1.68s/it]
|
1076 |
99%|ββββββββββ| 11188/11346 [5:38:11<04:26, 1.68s/it]
|
1077 |
99%|ββββββββββ| 11189/11346 [5:38:13<04:24, 1.68s/it]
|
1078 |
99%|ββββββββββ| 11190/11346 [5:38:15<04:22, 1.68s/it]
|
1079 |
99%|ββββββββββ| 11191/11346 [5:38:16<04:21, 1.68s/it]
|
1080 |
99%|ββββββββββ| 11192/11346 [5:38:18<04:19, 1.68s/it]
|
1081 |
99%|ββββββββββ| 11193/11346 [5:38:20<04:17, 1.68s/it]
|
1082 |
99%|ββββββββββ| 11194/11346 [5:38:22<04:16, 1.68s/it]
|
1083 |
99%|ββββββββββ| 11195/11346 [5:38:23<04:14, 1.68s/it]
|
1084 |
99%|ββββββββββ| 11196/11346 [5:38:25<04:12, 1.68s/it]
|
1085 |
99%|ββββββββββ| 11197/11346 [5:38:27<04:10, 1.68s/it]
|
1086 |
99%|ββββββββββ| 11198/11346 [5:38:28<04:09, 1.68s/it]
|
1087 |
99%|ββββββββββ| 11199/11346 [5:38:30<04:07, 1.68s/it]
|
1088 |
99%|ββββββββββ| 11200/11346 [5:38:32<04:05, 1.68s/it]
|
1089 |
99%|ββββββββββ| 11201/11346 [5:38:33<04:03, 1.68s/it]
|
1090 |
99%|ββββββββββ| 11202/11346 [5:38:35<04:02, 1.68s/it]
|
1091 |
99%|ββββββββββ| 11203/11346 [5:38:37<04:00, 1.68s/it]
|
1092 |
99%|ββββββββββ| 11204/11346 [5:38:38<03:59, 1.68s/it]
|
1093 |
99%|ββββββββββ| 11205/11346 [5:38:40<03:57, 1.68s/it]
|
1094 |
99%|ββββββββββ| 11206/11346 [5:38:42<03:55, 1.68s/it]
|
1095 |
99%|ββββββββββ| 11207/11346 [5:38:43<03:54, 1.68s/it]
|
1096 |
99%|ββββββββββ| 11208/11346 [5:38:45<03:52, 1.68s/it]
|
1097 |
99%|ββββββββββ| 11209/11346 [5:38:47<03:50, 1.68s/it]
|
1098 |
99%|ββββββββββ| 11210/11346 [5:38:48<03:49, 1.68s/it]
|
1099 |
99%|ββββββββββ| 11211/11346 [5:38:50<03:47, 1.68s/it]
|
1100 |
99%|ββββββββββ| 11212/11346 [5:38:52<03:45, 1.68s/it]
|
1101 |
99%|ββββββββββ| 11213/11346 [5:38:54<03:43, 1.68s/it]
|
1102 |
99%|ββββββββββ| 11214/11346 [5:38:55<03:42, 1.69s/it]
|
1103 |
99%|ββββββββββ| 11215/11346 [5:38:57<03:40, 1.68s/it]
|
1104 |
99%|ββββββββββ| 11216/11346 [5:38:59<03:39, 1.69s/it]
|
1105 |
99%|ββββββββββ| 11217/11346 [5:39:00<03:37, 1.69s/it]
|
1106 |
99%|ββββββββββ| 11218/11346 [5:39:02<03:35, 1.68s/it]
|
1107 |
99%|ββββββββββ| 11219/11346 [5:39:04<03:33, 1.68s/it]
|
1108 |
99%|ββββββββββ| 11220/11346 [5:39:05<03:32, 1.69s/it]
|
1109 |
99%|ββββββββββ| 11221/11346 [5:39:07<03:30, 1.69s/it]
|
1110 |
99%|ββββββββββ| 11222/11346 [5:39:09<03:28, 1.69s/it]
|
1111 |
99%|ββββββββββ| 11223/11346 [5:39:10<03:27, 1.69s/it]
|
1112 |
99%|ββββββββββ| 11224/11346 [5:39:12<03:25, 1.68s/it]
|
1113 |
99%|ββββββββββ| 11225/11346 [5:39:14<03:23, 1.69s/it]
|
1114 |
99%|ββββββββββ| 11226/11346 [5:39:15<03:22, 1.69s/it]
|
1115 |
99%|ββββββββββ| 11227/11346 [5:39:17<03:20, 1.68s/it]
|
1116 |
99%|ββββββββββ| 11228/11346 [5:39:19<03:18, 1.68s/it]
|
1117 |
99%|ββββββββββ| 11229/11346 [5:39:20<03:17, 1.69s/it]
|
1118 |
99%|ββββββββββ| 11230/11346 [5:39:22<03:15, 1.69s/it]
|
1119 |
99%|ββββββββββ| 11231/11346 [5:39:24<03:13, 1.69s/it]
|
1120 |
99%|ββββββββββ| 11232/11346 [5:39:26<03:12, 1.68s/it]
|
1121 |
99%|ββββββββββ| 11233/11346 [5:39:27<03:10, 1.68s/it]
|
1122 |
99%|ββββββββββ| 11234/11346 [5:39:29<03:08, 1.68s/it]
|
1123 |
99%|ββββββββββ| 11235/11346 [5:39:31<03:06, 1.68s/it]
|
1124 |
99%|ββββββββββ| 11236/11346 [5:39:32<03:05, 1.68s/it]
|
1125 |
99%|ββββββββββ| 11237/11346 [5:39:34<03:03, 1.68s/it]
|
1126 |
99%|ββββββββββ| 11238/11346 [5:39:36<03:01, 1.68s/it]
|
1127 |
99%|ββββββββββ| 11239/11346 [5:39:37<03:00, 1.68s/it]
|
1128 |
99%|ββββββββββ| 11240/11346 [5:39:39<02:58, 1.68s/it]
|
1129 |
99%|ββββββββββ| 11241/11346 [5:39:41<02:56, 1.68s/it]
|
1130 |
99%|ββββββββββ| 11242/11346 [5:39:42<02:55, 1.68s/it]
|
1131 |
99%|ββββββββββ| 11243/11346 [5:39:44<02:53, 1.68s/it]
|
1132 |
99%|ββββββββββ| 11244/11346 [5:39:46<02:51, 1.68s/it]
|
1133 |
99%|ββββββββββ| 11245/11346 [5:39:47<02:50, 1.68s/it]
|
1134 |
99%|ββββββββββ| 11246/11346 [5:39:49<02:48, 1.68s/it]
|
1135 |
99%|ββββββββββ| 11247/11346 [5:39:51<02:46, 1.69s/it]
|
1136 |
99%|ββββββββββ| 11248/11346 [5:39:52<02:45, 1.69s/it]
|
1137 |
99%|ββββββββββ| 11249/11346 [5:39:54<02:43, 1.69s/it]
|
1138 |
99%|ββββββββββ| 11250/11346 [5:39:56<02:41, 1.69s/it]
|
1139 |
99%|ββββββββββ| 11251/11346 [5:39:58<02:40, 1.69s/it]
|
1140 |
99%|ββββββββββ| 11252/11346 [5:39:59<02:38, 1.69s/it]
|
1141 |
99%|ββββββββββ| 11253/11346 [5:40:01<02:36, 1.69s/it]
|
1142 |
99%|ββββββββββ| 11254/11346 [5:40:03<02:34, 1.68s/it]
|
1143 |
99%|ββββββββββ| 11255/11346 [5:40:04<02:33, 1.68s/it]
|
1144 |
99%|ββββββββββ| 11256/11346 [5:40:06<02:31, 1.69s/it]
|
1145 |
99%|ββββββββββ| 11257/11346 [5:40:08<02:29, 1.68s/it]
|
1146 |
99%|ββββββββββ| 11258/11346 [5:40:09<02:28, 1.68s/it]
|
1147 |
99%|ββββββββββ| 11259/11346 [5:40:11<02:26, 1.68s/it]
|
1148 |
99%|ββββββββββ| 11260/11346 [5:40:13<02:24, 1.68s/it]
|
1149 |
99%|ββββββββββ| 11261/11346 [5:40:14<02:23, 1.68s/it]
|
1150 |
99%|ββββββββββ| 11262/11346 [5:40:16<02:21, 1.68s/it]
|
1151 |
99%|ββββββββββ| 11263/11346 [5:40:18<02:19, 1.68s/it]
|
1152 |
99%|ββββββββββ| 11264/11346 [5:40:19<02:17, 1.68s/it]
|
1153 |
99%|ββββββββββ| 11265/11346 [5:40:21<02:16, 1.68s/it]
|
1154 |
99%|ββββββββββ| 11266/11346 [5:40:23<02:14, 1.68s/it]
|
1155 |
99%|ββββββββββ| 11267/11346 [5:40:24<02:12, 1.68s/it]
|
1156 |
99%|ββββββββββ| 11268/11346 [5:40:26<02:11, 1.68s/it]
|
1157 |
99%|ββββββββββ| 11269/11346 [5:40:28<02:09, 1.68s/it]
|
1158 |
99%|ββββββββββ| 11270/11346 [5:40:30<02:07, 1.68s/it]
|
1159 |
99%|ββββββββββ| 11271/11346 [5:40:31<02:06, 1.68s/it]
|
1160 |
99%|ββββββββββ| 11272/11346 [5:40:33<02:04, 1.68s/it]
|
1161 |
99%|ββββββββββ| 11273/11346 [5:40:35<02:02, 1.68s/it]
|
1162 |
99%|ββββββββββ| 11274/11346 [5:40:36<02:01, 1.68s/it]
|
1163 |
99%|ββββββββββ| 11275/11346 [5:40:38<01:59, 1.68s/it]
|
1164 |
99%|ββββββββββ| 11276/11346 [5:40:40<01:57, 1.68s/it]
|
1165 |
99%|ββββββββββ| 11277/11346 [5:40:41<01:56, 1.68s/it]
|
1166 |
99%|ββββββββββ| 11278/11346 [5:40:43<01:54, 1.68s/it]
|
1167 |
99%|ββββββββββ| 11279/11346 [5:40:45<01:52, 1.68s/it]
|
1168 |
99%|ββββββββββ| 11280/11346 [5:40:46<01:51, 1.69s/it]
|
1169 |
99%|ββββββββββ| 11281/11346 [5:40:48<01:49, 1.69s/it]
|
1170 |
99%|ββββββββββ| 11282/11346 [5:40:50<01:47, 1.68s/it]
|
1171 |
99%|ββββββββββ| 11283/11346 [5:40:51<01:46, 1.68s/it]
|
1172 |
99%|ββββββββββ| 11284/11346 [5:40:53<01:44, 1.69s/it]
|
1173 |
99%|ββββββββββ| 11285/11346 [5:40:55<01:42, 1.68s/it]
|
1174 |
99%|ββββββββββ| 11286/11346 [5:40:56<01:41, 1.68s/it]
|
1175 |
99%|ββββββββββ| 11287/11346 [5:40:58<01:39, 1.68s/it]
|
1176 |
99%|ββββββββββ| 11288/11346 [5:41:00<01:37, 1.68s/it]
|
1177 |
99%|ββββββββββ| 11289/11346 [5:41:02<01:36, 1.69s/it]
|
1178 |
+
|
1179 |
+
Training completed. Do not forget to share your model on huggingface.co/models =)
|
1180 |
+
|
1181 |
+
|
1182 |
+
|
1183 |
|
1184 |
+
[INFO|trainer.py:3353] 2024-06-04 06:44:27,410 >> Saving model checkpoint to ./training_outputs_job_116987_1_04-06_01-01
|
1185 |
+
[INFO|configuration_utils.py:471] 2024-06-04 06:44:27,424 >> Configuration saved in ./training_outputs_job_116987_1_04-06_01-01/config.json
|
1186 |
+
[INFO|configuration_utils.py:705] 2024-06-04 06:44:27,429 >> Configuration saved in ./training_outputs_job_116987_1_04-06_01-01/generation_config.json
|
1187 |
+
[INFO|modeling_utils.py:2592] 2024-06-04 06:44:28,362 >> Model weights saved in ./training_outputs_job_116987_1_04-06_01-01/model.safetensors
|
1188 |
+
[INFO|tokenization_utils_base.py:2503] 2024-06-04 06:44:28,373 >> tokenizer config file saved in ./training_outputs_job_116987_1_04-06_01-01/tokenizer_config.json
|
1189 |
+
[INFO|tokenization_utils_base.py:2512] 2024-06-04 06:44:28,377 >> Special tokens file saved in ./training_outputs_job_116987_1_04-06_01-01/special_tokens_map.json
|
1190 |
+
[INFO|trainer.py:3353] 2024-06-04 06:44:28,428 >> Saving model checkpoint to ./training_outputs_job_116987_1_04-06_01-01
|
1191 |
+
[INFO|configuration_utils.py:471] 2024-06-04 06:44:28,432 >> Configuration saved in ./training_outputs_job_116987_1_04-06_01-01/config.json
|
1192 |
+
[INFO|configuration_utils.py:705] 2024-06-04 06:44:28,436 >> Configuration saved in ./training_outputs_job_116987_1_04-06_01-01/generation_config.json
|
1193 |
+
[INFO|modeling_utils.py:2592] 2024-06-04 06:44:29,368 >> Model weights saved in ./training_outputs_job_116987_1_04-06_01-01/model.safetensors
|
1194 |
+
[INFO|tokenization_utils_base.py:2503] 2024-06-04 06:44:29,380 >> tokenizer config file saved in ./training_outputs_job_116987_1_04-06_01-01/tokenizer_config.json
|
1195 |
+
[INFO|tokenization_utils_base.py:2512] 2024-06-04 06:44:29,384 >> Special tokens file saved in ./training_outputs_job_116987_1_04-06_01-01/special_tokens_map.json
|
1196 |
+
[INFO|modelcard.py:450] 2024-06-04 06:44:29,735 >> Dropping the following result as it does not have all the necessary fields:
|
1197 |
+
{'task': {'name': 'Causal Language Modeling', 'type': 'text-generation'}, 'metrics': [{'name': 'Accuracy', 'type': 'accuracy', 'value': 0.5811017183152439}]}
|
1198 |
+
{'eval_loss': 2.3603451251983643, 'eval_accuracy': 0.5811017183152439, 'eval_runtime': 129.0604, 'eval_samples_per_second': 14.257, 'eval_steps_per_second': 0.302, 'epoch': 2.91}
|
1199 |
+
{'train_runtime': 20556.3593, 'train_samples_per_second': 13.243, 'train_steps_per_second': 0.552, 'train_loss': 2.5941595100713495, 'epoch': 3.0}
|
1200 |
+
|
1201 |
+
|
1202 |
+
|
1203 |
+
|
1204 |
+
|
1205 |
+
|
1206 |
+
|
1207 |
+
|
1208 |
+
|
1209 |
+
|
1210 |
+
|
1211 |
+
|
1212 |
+
|
1213 |
+
|
1214 |
+
|
1215 |
+
|
1216 |
+
|
1217 |
+
|
1218 |
+
|
1219 |
+
|
1220 |
+
|
1221 |
+
|
1222 |
+
|
1223 |
+
|
1224 |
+
|
1225 |
+
|
1226 |
+
|
1227 |
+
|
1228 |
+
|
1229 |
+
|
1230 |
+
|
1231 |
+
|
1232 |
+
|
1233 |
+
|
1234 |
+
|
1235 |
+
|
1236 |
+
|
1237 |
+
|
1238 |
+
|
1239 |
+
|
1240 |
+
|
1241 |
+
|
1242 |
+
|
1243 |
+
|
1244 |
+
|
1245 |
+
|
1246 |
+
|
1247 |
+
|
1248 |
+
|
1249 |
+
|
1250 |
+
|
1251 |
+
|
1252 |
+
|
1253 |
+
|
1254 |
+
|
1255 |
+
|
1256 |
+
|
1257 |
+
|
1258 |
+
|
1259 |
+
|
1260 |
+
|
1261 |
+
|
1262 |
+
|
1263 |
+
|
1264 |
+
|
1265 |
+
|
1266 |
+
|
1267 |
+
|
1268 |
+
|
1269 |
+
|
1270 |
+
|
1271 |
+
|
1272 |
+
|
1273 |
+
|
1274 |
+
|
1275 |
+
|
1276 |
+
|
1277 |
+
|
1278 |
+
|
1279 |
+
|
1280 |
+
|
1281 |
+
|
1282 |
+
|
1283 |
+
|
1284 |
+
|
1285 |
+
|
1286 |
+
|
1287 |
+
|
1288 |
+
|
1289 |
+
|
1290 |
+
|
1291 |
+
|
1292 |
+
|
1293 |
+
|
1294 |
+
|
1295 |
+
|
1296 |
+
|
1297 |
+
|
1298 |
+
|
1299 |
+
|
1300 |
+
|
1301 |
+
|
1302 |
+
|
1303 |
+
|
1304 |
+
|
1305 |
+
|
1306 |
+
|
1307 |
+
|
1308 |
+
|
1309 |
+
|
1310 |
+
|
model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 272123144
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:4da20f027b42fc4101ece18c5fdcbc8330d01d2227400bc85f4697eff0c2ee26
|
3 |
size 272123144
|