yassineafr commited on May 22

Commit

38a97c5

•

1 Parent(s): 7dfb9bc

jaisBdarija

Browse files

Files changed (27) hide show

README.md +55 -0
adapter_config.json +28 -0
adapter_model.safetensors +3 -0
runs/May22_11-21-59_2c1b614ec68f/events.out.tfevents.1716376951.2c1b614ec68f.34.0 +3 -0
runs/May22_11-21-59_2c1b614ec68f/events.out.tfevents.1716377045.2c1b614ec68f.34.1 +3 -0
runs/May22_11-33-56_2c1b614ec68f/events.out.tfevents.1716377651.2c1b614ec68f.217.0 +3 -0
training_args.bin +3 -0
wandb/debug-internal.log +0 -0
wandb/debug.log +54 -0
wandb/run-20240522_112259-4b714brj/files/conda-environment.yaml +0 -0
wandb/run-20240522_112259-4b714brj/files/config.yaml +737 -0
wandb/run-20240522_112259-4b714brj/files/output.log +3 -0
wandb/run-20240522_112259-4b714brj/files/requirements.txt +878 -0
wandb/run-20240522_112259-4b714brj/files/wandb-metadata.json +62 -0
wandb/run-20240522_112259-4b714brj/files/wandb-summary.json +1 -0
wandb/run-20240522_112259-4b714brj/logs/debug-internal.log +308 -0
wandb/run-20240522_112259-4b714brj/logs/debug.log +48 -0
wandb/run-20240522_112259-4b714brj/run-4b714brj.wandb +0 -0
wandb/run-20240522_113413-8mudzhjp/files/conda-environment.yaml +0 -0
wandb/run-20240522_113413-8mudzhjp/files/config.yaml +754 -0
wandb/run-20240522_113413-8mudzhjp/files/output.log +93 -0
wandb/run-20240522_113413-8mudzhjp/files/requirements.txt +878 -0
wandb/run-20240522_113413-8mudzhjp/files/wandb-metadata.json +62 -0
wandb/run-20240522_113413-8mudzhjp/files/wandb-summary.json +1 -0
wandb/run-20240522_113413-8mudzhjp/logs/debug-internal.log +0 -0
wandb/run-20240522_113413-8mudzhjp/logs/debug.log +54 -0
wandb/run-20240522_113413-8mudzhjp/run-8mudzhjp.wandb +0 -0

README.md ADDED Viewed

	@@ -0,0 +1,55 @@

+---
+license: apache-2.0
+library_name: peft
+tags:
+- generated_from_trainer
+base_model: core42/jais-13b
+model-index:
+- name: working
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/yassine-af/huggingface/runs/8mudzhjp)
+# working
+This model is a fine-tuned version of [core42/jais-13b](https://huggingface.co/core42/jais-13b) on an unknown dataset.
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 0.0002
+- train_batch_size: 8
+- eval_batch_size: 8
+- seed: 42
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- num_epochs: 2
+### Training results
+### Framework versions
+- PEFT 0.11.1
+- Transformers 4.41.0
+- Pytorch 2.3.0+cu121
+- Datasets 2.18.0
+- Tokenizers 0.19.1

adapter_config.json ADDED Viewed

	@@ -0,0 +1,28 @@

+{
+  "alpha_pattern": {},
+  "auto_mapping": null,
+  "base_model_name_or_path": "core42/jais-13b",
+  "bias": "none",
+  "fan_in_fan_out": false,
+  "inference_mode": true,
+  "init_lora_weights": true,
+  "layer_replication": null,
+  "layers_pattern": null,
+  "layers_to_transform": null,
+  "loftq_config": {},
+  "lora_alpha": 32,
+  "lora_dropout": 0.05,
+  "megatron_config": null,
+  "megatron_core": "megatron.core",
+  "modules_to_save": null,
+  "peft_type": "LORA",
+  "r": 16,
+  "rank_pattern": {},
+  "revision": null,
+  "target_modules": [
+    "c_attn"
+  ],
+  "task_type": "CAUSAL_LM",
+  "use_dora": false,
+  "use_rslora": false
+}

adapter_model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:06eb94a68730c3df8c86971da90ff7ccdb9675cacf17d64680c2d56ad19ca84b
+size 52439304

runs/May22_11-21-59_2c1b614ec68f/events.out.tfevents.1716376951.2c1b614ec68f.34.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:77aac7af4f1322783dc01d0658d7058e73d34e75853655c8a7e4058af5ec5883
+size 4184

runs/May22_11-21-59_2c1b614ec68f/events.out.tfevents.1716377045.2c1b614ec68f.34.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:c7ce9b9bce6054bed83d2e68701535d98230f470f01c89f9556a0294ba6b9182
+size 4184

runs/May22_11-33-56_2c1b614ec68f/events.out.tfevents.1716377651.2c1b614ec68f.217.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:ba5e7576561ebc4f2e00618a7caa6fcca241fcf314ff5cb2df704c7d5bf07b77
+size 71472

training_args.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:84f44aec00f66301ea7ba9dc7fc89b45e0f501cd02ad5a27ad189e368306aa4a
+size 5112

wandb/debug-internal.log ADDED Viewed

The diff for this file is too large to render. See raw diff

wandb/debug.log ADDED Viewed

	@@ -0,0 +1,54 @@

+2024-05-22 11:34:13,996 INFO    MainThread:217 [wandb_setup.py:_flush():76] Current SDK version is 0.16.6
+2024-05-22 11:34:13,997 INFO    MainThread:217 [wandb_setup.py:_flush():76] Configure stats pid to 217
+2024-05-22 11:34:13,997 INFO    MainThread:217 [wandb_setup.py:_flush():76] Loading settings from /root/.config/wandb/settings
+2024-05-22 11:34:13,997 INFO    MainThread:217 [wandb_setup.py:_flush():76] Loading settings from /kaggle/working/wandb/settings
+2024-05-22 11:34:13,997 INFO    MainThread:217 [wandb_setup.py:_flush():76] Loading settings from environment variables: {}
+2024-05-22 11:34:13,997 INFO    MainThread:217 [wandb_setup.py:_flush():76] Applying setup settings: {'_disable_service': False}
+2024-05-22 11:34:13,997 INFO    MainThread:217 [wandb_setup.py:_flush():76] Inferring run settings from compute environment: {'program': '<python with no main file>'}
+2024-05-22 11:34:13,997 INFO    MainThread:217 [wandb_setup.py:_flush():76] Applying login settings: {}
+2024-05-22 11:34:13,997 INFO    MainThread:217 [wandb_init.py:_log_setup():521] Logging user logs to /kaggle/working/wandb/run-20240522_113413-8mudzhjp/logs/debug.log
+2024-05-22 11:34:13,997 INFO    MainThread:217 [wandb_init.py:_log_setup():522] Logging internal logs to /kaggle/working/wandb/run-20240522_113413-8mudzhjp/logs/debug-internal.log
+2024-05-22 11:34:13,997 INFO    MainThread:217 [wandb_init.py:_jupyter_setup():467] configuring jupyter hooks <wandb.sdk.wandb_init._WandbInit object at 0x7ef92390cee0>
+2024-05-22 11:34:13,998 INFO    MainThread:217 [wandb_init.py:init():561] calling init triggers
+2024-05-22 11:34:13,998 INFO    MainThread:217 [wandb_init.py:init():568] wandb.init called with sweep_config: {}
+config: {}
+2024-05-22 11:34:13,998 INFO    MainThread:217 [wandb_init.py:init():611] starting backend
+2024-05-22 11:34:13,998 INFO    MainThread:217 [wandb_init.py:init():615] setting up manager
+2024-05-22 11:34:14,000 INFO    MainThread:217 [backend.py:_multiprocessing_setup():105] multiprocessing start_methods=fork,spawn,forkserver, using: spawn
+2024-05-22 11:34:14,002 INFO    MainThread:217 [wandb_init.py:init():623] backend started and connected
+2024-05-22 11:34:14,016 INFO    MainThread:217 [wandb_run.py:_label_probe_notebook():1299] probe notebook
+2024-05-22 11:34:14,540 INFO    MainThread:217 [wandb_init.py:init():715] updated telemetry
+2024-05-22 11:34:14,544 INFO    MainThread:217 [wandb_init.py:init():748] communicating run to backend with 90.0 second timeout
+2024-05-22 11:34:14,778 INFO    MainThread:217 [wandb_run.py:_on_init():2357] communicating current version
+2024-05-22 11:34:14,843 INFO    MainThread:217 [wandb_run.py:_on_init():2366] got version response upgrade_message: "wandb version 0.17.0 is available!  To upgrade, please run:\n $ pip install wandb --upgrade"
+2024-05-22 11:34:14,844 INFO    MainThread:217 [wandb_init.py:init():799] starting run threads in backend
+2024-05-22 11:34:30,856 INFO    MainThread:217 [wandb_run.py:_console_start():2335] atexit reg
+2024-05-22 11:34:30,857 INFO    MainThread:217 [wandb_run.py:_redirect():2190] redirect: wrap_raw
+2024-05-22 11:34:30,857 INFO    MainThread:217 [wandb_run.py:_redirect():2255] Wrapping output streams.
+2024-05-22 11:34:30,857 INFO    MainThread:217 [wandb_run.py:_redirect():2280] Redirects installed.
+2024-05-22 11:34:30,858 INFO    MainThread:217 [wandb_init.py:init():842] run started, returning control to user process
+2024-05-22 11:34:30,865 INFO    MainThread:217 [wandb_run.py:_config_callback():1347] config_cb None None {'peft_config': {'default': {'peft_type': <PeftType.LORA: 'LORA'>, 'auto_mapping': None, 'base_model_name_or_path': 'core42/jais-13b', 'revision': None, 'task_type': 'CAUSAL_LM', 'inference_mode': False, 'r': 16, 'target_modules': {'c_attn'}, 'lora_alpha': 32, 'lora_dropout': 0.05, 'fan_in_fan_out': False, 'bias': 'none', 'use_rslora': False, 'modules_to_save': None, 'init_lora_weights': True, 'layers_to_transform': None, 'layers_pattern': None, 'rank_pattern': {}, 'alpha_pattern': {}, 'megatron_config': None, 'megatron_core': 'megatron.core', 'loftq_config': {}, 'use_dora': False, 'layer_replication': None}}, 'vocab_size': 84992, 'n_positions': 2048, 'n_embd': 5120, 'n_layer': 40, 'n_head': 40, 'n_inner': 13653, 'activation_function': 'swiglu', 'resid_pdrop': 0.0, 'embd_pdrop': 0.0, 'attn_pdrop': 0.0, 'layer_norm_epsilon': 1e-05, 'initializer_range': 0.02, 'scale_attn_weights': True, 'use_cache': False, 'scale_attn_by_inverse_layer_idx': False, 'reorder_and_upcast_attn': False, 'bos_token_id': 0, 'eos_token_id': 0, 'position_embedding_type': 'alibi', 'width_scale': 0.11100000000000002, 'embeddings_scale': 14.6, 'scale_qk_dot_by_d': True, 'return_dict': True, 'output_hidden_states': False, 'output_attentions': False, 'torchscript': False, 'torch_dtype': 'float32', 'use_bfloat16': False, 'tf_legacy_loss': False, 'pruned_heads': {}, 'tie_word_embeddings': True, 'chunk_size_feed_forward': 0, 'is_encoder_decoder': False, 'is_decoder': False, 'cross_attention_hidden_size': None, 'add_cross_attention': False, 'tie_encoder_decoder': False, 'max_length': 20, 'min_length': 0, 'do_sample': False, 'early_stopping': False, 'num_beams': 1, 'num_beam_groups': 1, 'diversity_penalty': 0.0, 'temperature': 1.0, 'top_k': 50, 'top_p': 1.0, 'typical_p': 1.0, 'repetition_penalty': 1.0, 'length_penalty': 1.0, 'no_repeat_ngram_size': 0, 'encoder_no_repeat_ngram_size': 0, 'bad_words_ids': None, 'num_return_sequences': 1, 'output_scores': False, 'return_dict_in_generate': False, 'forced_bos_token_id': None, 'forced_eos_token_id': None, 'remove_invalid_values': False, 'exponential_decay_length_penalty': None, 'suppress_tokens': None, 'begin_suppress_tokens': None, 'architectures': ['JAISLMHeadModel'], 'finetuning_task': None, 'id2label': {0: 'LABEL_0', 1: 'LABEL_1'}, 'label2id': {'LABEL_0': 0, 'LABEL_1': 1}, 'tokenizer_class': None, 'prefix': None, 'pad_token_id': 0, 'sep_token_id': None, 'decoder_start_token_id': None, 'task_specific_params': None, 'problem_type': None, '_name_or_path': 'core42/jais-13b', 'transformers_version': '4.41.0', 'auto_map': {'AutoConfig': 'core42/jais-13b--configuration_jais.JAISConfig', 'AutoModel': 'core42/jais-13b--modeling_jais.JAISModel', 'AutoModelForCausalLM': 'core42/jais-13b--modeling_jais.JAISLMHeadModel', 'AutoModelForQuestionAnswering': 'core42/jais-13b--modeling_jais.JAISForQuestionAnswering', 'AutoModelForSequenceClassification': 'core42/jais-13b--modeling_jais.JAISForSequenceClassification', 'AutoModelForTokenClassification': 'core42/jais-13b--modeling_jais.JAISForTokenClassification'}, 'model_type': 'jais', 'quantization_config': {'quant_method': 'QuantizationMethod.BITS_AND_BYTES', '_load_in_8bit': False, '_load_in_4bit': True, 'llm_int8_threshold': 6.0, 'llm_int8_skip_modules': None, 'llm_int8_enable_fp32_cpu_offload': False, 'llm_int8_has_fp16_weight': False, 'bnb_4bit_quant_type': 'nf4', 'bnb_4bit_use_double_quant': False, 'bnb_4bit_compute_dtype': 'bfloat16', 'bnb_4bit_quant_storage': 'uint8', 'load_in_4bit': True, 'load_in_8bit': False}, 'output_dir': '/kaggle/working/', 'overwrite_output_dir': False, 'do_train': False, 'do_eval': False, 'do_predict': False, 'eval_strategy': 'no', 'prediction_loss_only': False, 'per_device_train_batch_size': 8, 'per_device_eval_batch_size': 8, 'per_gpu_train_batch_size': None, 'per_gpu_eval_batch_size': None, 'gradient_accumulation_steps': 1, 'eval_accumulation_steps': None, 'eval_delay': 0, 'learning_rate': 0.0002, 'weight_decay': 0.0, 'adam_beta1': 0.9, 'adam_beta2': 0.999, 'adam_epsilon': 1e-08, 'max_grad_norm': 1.0, 'num_train_epochs': 2, 'max_steps': -1, 'lr_scheduler_type': 'linear', 'lr_scheduler_kwargs': {}, 'warmup_ratio': 0.0, 'warmup_steps': 0, 'log_level': 'passive', 'log_level_replica': 'warning', 'log_on_each_node': True, 'logging_dir': '/kaggle/working/runs/May22_11-33-56_2c1b614ec68f', 'logging_strategy': 'steps', 'logging_first_step': False, 'logging_steps': 10, 'logging_nan_inf_filter': True, 'save_strategy': 'epoch', 'save_steps': 500, 'save_total_limit': 4, 'save_safetensors': True, 'save_on_each_node': False, 'save_only_model': False, 'restore_callback_states_from_checkpoint': False, 'no_cuda': False, 'use_cpu': False, 'use_mps_device': False, 'seed': 42, 'data_seed': None, 'jit_mode_eval': False, 'use_ipex': False, 'bf16': True, 'fp16': False, 'fp16_opt_level': 'O1', 'half_precision_backend': 'auto', 'bf16_full_eval': False, 'fp16_full_eval': False, 'tf32': None, 'local_rank': 0, 'ddp_backend': None, 'tpu_num_cores': None, 'tpu_metrics_debug': False, 'debug': [], 'dataloader_drop_last': False, 'eval_steps': None, 'dataloader_num_workers': 0, 'dataloader_prefetch_factor': None, 'past_index': -1, 'run_name': '/kaggle/working/', 'disable_tqdm': False, 'remove_unused_columns': True, 'label_names': None, 'load_best_model_at_end': False, 'metric_for_best_model': None, 'greater_is_better': None, 'ignore_data_skip': False, 'fsdp': [], 'fsdp_min_num_params': 0, 'fsdp_config': {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}, 'fsdp_transformer_layer_cls_to_wrap': None, 'accelerator_config': {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}, 'deepspeed': None, 'label_smoothing_factor': 0.0, 'optim': 'adamw_torch', 'optim_args': None, 'adafactor': False, 'group_by_length': False, 'length_column_name': 'length', 'report_to': ['tensorboard', 'wandb'], 'ddp_find_unused_parameters': None, 'ddp_bucket_cap_mb': None, 'ddp_broadcast_buffers': None, 'dataloader_pin_memory': True, 'dataloader_persistent_workers': False, 'skip_memory_metrics': True, 'use_legacy_prediction_loop': False, 'push_to_hub': False, 'resume_from_checkpoint': None, 'hub_model_id': None, 'hub_strategy': 'every_save', 'hub_token': '<HUB_TOKEN>', 'hub_private_repo': False, 'hub_always_push': False, 'gradient_checkpointing': False, 'gradient_checkpointing_kwargs': None, 'include_inputs_for_metrics': False, 'eval_do_concat_batches': True, 'fp16_backend': 'auto', 'evaluation_strategy': None, 'push_to_hub_model_id': None, 'push_to_hub_organization': None, 'push_to_hub_token': '<PUSH_TO_HUB_TOKEN>', 'mp_parameters': '', 'auto_find_batch_size': True, 'full_determinism': False, 'torchdynamo': None, 'ray_scope': 'last', 'ddp_timeout': 1800, 'torch_compile': False, 'torch_compile_backend': None, 'torch_compile_mode': None, 'dispatch_batches': None, 'split_batches': None, 'include_tokens_per_second': False, 'include_num_input_tokens_seen': False, 'neftune_noise_alpha': None, 'optim_target_modules': None, 'batch_eval_metrics': False}
+2024-05-22 11:34:30,875 INFO    MainThread:217 [wandb_config.py:__setitem__():151] config set model/num_parameters = 13033919160 - <bound method Run._config_callback of <wandb.sdk.wandb_run.Run object at 0x7ef9227a9060>>
+2024-05-22 11:34:30,876 INFO    MainThread:217 [wandb_run.py:_config_callback():1347] config_cb model/num_parameters 13033919160 None
+2024-05-22 14:04:41,874 INFO    MainThread:217 [jupyter.py:save_ipynb():373] not saving jupyter notebook
+2024-05-22 14:04:41,875 INFO    MainThread:217 [wandb_init.py:_pause_backend():432] pausing backend
+2024-05-22 14:14:52,958 INFO    MainThread:217 [wandb_init.py:_resume_backend():437] resuming backend
+2024-05-22 14:14:54,437 INFO    MainThread:217 [jupyter.py:save_ipynb():373] not saving jupyter notebook
+2024-05-22 14:14:54,437 INFO    MainThread:217 [wandb_init.py:_pause_backend():432] pausing backend
+2024-05-22 14:15:26,186 INFO    MainThread:217 [wandb_init.py:_resume_backend():437] resuming backend
+2024-05-22 14:16:25,347 INFO    MainThread:217 [jupyter.py:save_ipynb():373] not saving jupyter notebook
+2024-05-22 14:16:25,348 INFO    MainThread:217 [wandb_init.py:_pause_backend():432] pausing backend
+2024-05-22 14:16:29,691 INFO    MainThread:217 [wandb_init.py:_resume_backend():437] resuming backend
+2024-05-22 14:16:44,749 INFO    MainThread:217 [jupyter.py:save_ipynb():373] not saving jupyter notebook
+2024-05-22 14:16:44,749 INFO    MainThread:217 [wandb_init.py:_pause_backend():432] pausing backend
+2024-05-22 14:23:14,136 INFO    MainThread:217 [wandb_init.py:_resume_backend():437] resuming backend
+2024-05-22 14:23:16,353 INFO    MainThread:217 [jupyter.py:save_ipynb():373] not saving jupyter notebook
+2024-05-22 14:23:16,353 INFO    MainThread:217 [wandb_init.py:_pause_backend():432] pausing backend
+2024-05-22 14:26:18,732 INFO    MainThread:217 [wandb_init.py:_resume_backend():437] resuming backend
+2024-05-22 14:26:19,623 INFO    MainThread:217 [jupyter.py:save_ipynb():373] not saving jupyter notebook
+2024-05-22 14:26:19,624 INFO    MainThread:217 [wandb_init.py:_pause_backend():432] pausing backend
+2024-05-22 14:34:00,493 INFO    MainThread:217 [wandb_init.py:_resume_backend():437] resuming backend
+2024-05-22 14:34:00,984 INFO    MainThread:217 [jupyter.py:save_ipynb():373] not saving jupyter notebook
+2024-05-22 14:34:00,984 INFO    MainThread:217 [wandb_init.py:_pause_backend():432] pausing backend
+2024-05-22 14:34:16,410 INFO    MainThread:217 [wandb_init.py:_resume_backend():437] resuming backend

wandb/run-20240522_112259-4b714brj/files/conda-environment.yaml ADDED Viewed

File without changes

wandb/run-20240522_112259-4b714brj/files/config.yaml ADDED Viewed

	@@ -0,0 +1,737 @@

+wandb_version: 1
+_wandb:
+  desc: null
+  value:
+    python_version: 3.10.13
+    cli_version: 0.16.6
+    framework: huggingface
+    huggingface_version: 4.41.0
+    is_jupyter_run: true
+    is_kaggle_kernel: true
+    start_time: 1716376979.0
+    t:
+      1:
+      - 1
+      - 2
+      - 3
+      - 5
+      - 11
+      - 12
+      - 49
+      - 51
+      - 53
+      - 55
+      - 71
+      - 98
+      - 105
+      2:
+      - 1
+      - 2
+      - 3
+      - 5
+      - 11
+      - 12
+      - 49
+      - 51
+      - 53
+      - 55
+      - 71
+      - 98
+      - 105
+      3:
+      - 7
+      - 13
+      - 19
+      - 23
+      4: 3.10.13
+      5: 0.16.6
+      6: 4.41.0
+      8:
+      - 1
+      - 2
+      - 5
+      9:
+        1: transformers_trainer
+      13: linux-x86_64
+    m:
+    - 1: train/global_step
+      6:
+      - 3
+peft_config:
+  desc: null
+  value:
+    default:
+      peft_type: LORA
+      auto_mapping: null
+      base_model_name_or_path: core42/jais-13b
+      revision: null
+      task_type: CAUSAL_LM
+      inference_mode: false
+      r: 16
+      target_modules:
+      - c_attn
+      lora_alpha: 32
+      lora_dropout: 0.05
+      fan_in_fan_out: false
+      bias: none
+      use_rslora: false
+      modules_to_save: null
+      init_lora_weights: true
+      layers_to_transform: null
+      layers_pattern: null
+      rank_pattern: {}
+      alpha_pattern: {}
+      megatron_config: null
+      megatron_core: megatron.core
+      loftq_config: {}
+      use_dora: false
+      layer_replication: null
+vocab_size:
+  desc: null
+  value: 84992
+n_positions:
+  desc: null
+  value: 2048
+n_embd:
+  desc: null
+  value: 5120
+n_layer:
+  desc: null
+  value: 40
+n_head:
+  desc: null
+  value: 40
+n_inner:
+  desc: null
+  value: 13653
+activation_function:
+  desc: null
+  value: swiglu
+resid_pdrop:
+  desc: null
+  value: 0.0
+embd_pdrop:
+  desc: null
+  value: 0.0
+attn_pdrop:
+  desc: null
+  value: 0.0
+layer_norm_epsilon:
+  desc: null
+  value: 1.0e-05
+initializer_range:
+  desc: null
+  value: 0.02
+scale_attn_weights:
+  desc: null
+  value: true
+use_cache:
+  desc: null
+  value: false
+scale_attn_by_inverse_layer_idx:
+  desc: null
+  value: false
+reorder_and_upcast_attn:
+  desc: null
+  value: false
+bos_token_id:
+  desc: null
+  value: 0
+eos_token_id:
+  desc: null
+  value: 0
+position_embedding_type:
+  desc: null
+  value: alibi
+width_scale:
+  desc: null
+  value: 0.11100000000000002
+embeddings_scale:
+  desc: null
+  value: 14.6
+scale_qk_dot_by_d:
+  desc: null
+  value: true
+return_dict:
+  desc: null
+  value: true
+output_hidden_states:
+  desc: null
+  value: false
+output_attentions:
+  desc: null
+  value: false
+torchscript:
+  desc: null
+  value: false
+torch_dtype:
+  desc: null
+  value: float32
+use_bfloat16:
+  desc: null
+  value: false
+tf_legacy_loss:
+  desc: null
+  value: false
+pruned_heads:
+  desc: null
+  value: {}
+tie_word_embeddings:
+  desc: null
+  value: true
+chunk_size_feed_forward:
+  desc: null
+  value: 0
+is_encoder_decoder:
+  desc: null
+  value: false
+is_decoder:
+  desc: null
+  value: false
+cross_attention_hidden_size:
+  desc: null
+  value: null
+add_cross_attention:
+  desc: null
+  value: false
+tie_encoder_decoder:
+  desc: null
+  value: false
+max_length:
+  desc: null
+  value: 20
+min_length:
+  desc: null
+  value: 0
+do_sample:
+  desc: null
+  value: false
+early_stopping:
+  desc: null
+  value: false
+num_beams:
+  desc: null
+  value: 1
+num_beam_groups:
+  desc: null
+  value: 1
+diversity_penalty:
+  desc: null
+  value: 0.0
+temperature:
+  desc: null
+  value: 1.0
+top_k:
+  desc: null
+  value: 50
+top_p:
+  desc: null
+  value: 1.0
+typical_p:
+  desc: null
+  value: 1.0
+repetition_penalty:
+  desc: null
+  value: 1.0
+length_penalty:
+  desc: null
+  value: 1.0
+no_repeat_ngram_size:
+  desc: null
+  value: 0
+encoder_no_repeat_ngram_size:
+  desc: null
+  value: 0
+bad_words_ids:
+  desc: null
+  value: null
+num_return_sequences:
+  desc: null
+  value: 1
+output_scores:
+  desc: null
+  value: false
+return_dict_in_generate:
+  desc: null
+  value: false
+forced_bos_token_id:
+  desc: null
+  value: null
+forced_eos_token_id:
+  desc: null
+  value: null
+remove_invalid_values:
+  desc: null
+  value: false
+exponential_decay_length_penalty:
+  desc: null
+  value: null
+suppress_tokens:
+  desc: null
+  value: null
+begin_suppress_tokens:
+  desc: null
+  value: null
+architectures:
+  desc: null
+  value:
+  - JAISLMHeadModel
+finetuning_task:
+  desc: null
+  value: null
+id2label:
+  desc: null
+  value:
+    '0': LABEL_0
+    '1': LABEL_1
+label2id:
+  desc: null
+  value:
+    LABEL_0: 0
+    LABEL_1: 1
+tokenizer_class:
+  desc: null
+  value: null
+prefix:
+  desc: null
+  value: null
+pad_token_id:
+  desc: null
+  value: 0
+sep_token_id:
+  desc: null
+  value: null
+decoder_start_token_id:
+  desc: null
+  value: null
+task_specific_params:
+  desc: null
+  value: null
+problem_type:
+  desc: null
+  value: null
+_name_or_path:
+  desc: null
+  value: core42/jais-13b
+transformers_version:
+  desc: null
+  value: 4.41.0
+auto_map:
+  desc: null
+  value:
+    AutoConfig: core42/jais-13b--configuration_jais.JAISConfig
+    AutoModel: core42/jais-13b--modeling_jais.JAISModel
+    AutoModelForCausalLM: core42/jais-13b--modeling_jais.JAISLMHeadModel
+    AutoModelForQuestionAnswering: core42/jais-13b--modeling_jais.JAISForQuestionAnswering
+    AutoModelForSequenceClassification: core42/jais-13b--modeling_jais.JAISForSequenceClassification
+    AutoModelForTokenClassification: core42/jais-13b--modeling_jais.JAISForTokenClassification
+model_type:
+  desc: null
+  value: jais
+quantization_config:
+  desc: null
+  value:
+    quant_method: QuantizationMethod.BITS_AND_BYTES
+    _load_in_8bit: false
+    _load_in_4bit: true
+    llm_int8_threshold: 6.0
+    llm_int8_skip_modules: null
+    llm_int8_enable_fp32_cpu_offload: false
+    llm_int8_has_fp16_weight: false
+    bnb_4bit_quant_type: nf4
+    bnb_4bit_use_double_quant: false
+    bnb_4bit_compute_dtype: bfloat16
+    bnb_4bit_quant_storage: uint8
+    load_in_4bit: true
+    load_in_8bit: false
+output_dir:
+  desc: null
+  value: /kaggle/working/
+overwrite_output_dir:
+  desc: null
+  value: false
+do_train:
+  desc: null
+  value: false
+do_eval:
+  desc: null
+  value: false
+do_predict:
+  desc: null
+  value: false
+eval_strategy:
+  desc: null
+  value: 'no'
+prediction_loss_only:
+  desc: null
+  value: false
+per_device_train_batch_size:
+  desc: null
+  value: 8
+per_device_eval_batch_size:
+  desc: null
+  value: 8
+per_gpu_train_batch_size:
+  desc: null
+  value: null
+per_gpu_eval_batch_size:
+  desc: null
+  value: null
+gradient_accumulation_steps:
+  desc: null
+  value: 1
+eval_accumulation_steps:
+  desc: null
+  value: null
+eval_delay:
+  desc: null
+  value: 0
+learning_rate:
+  desc: null
+  value: 0.0002
+weight_decay:
+  desc: null
+  value: 0.0
+adam_beta1:
+  desc: null
+  value: 0.9
+adam_beta2:
+  desc: null
+  value: 0.999
+adam_epsilon:
+  desc: null
+  value: 1.0e-08
+max_grad_norm:
+  desc: null
+  value: 1.0
+num_train_epochs:
+  desc: null
+  value: 2
+max_steps:
+  desc: null
+  value: -1
+lr_scheduler_type:
+  desc: null
+  value: linear
+lr_scheduler_kwargs:
+  desc: null
+  value: {}
+warmup_ratio:
+  desc: null
+  value: 0.0
+warmup_steps:
+  desc: null
+  value: 0
+log_level:
+  desc: null
+  value: passive
+log_level_replica:
+  desc: null
+  value: warning
+log_on_each_node:
+  desc: null
+  value: true
+logging_dir:
+  desc: null
+  value: /kaggle/working/runs/May22_11-21-59_2c1b614ec68f
+logging_strategy:
+  desc: null
+  value: steps
+logging_first_step:
+  desc: null
+  value: false
+logging_steps:
+  desc: null
+  value: 10
+logging_nan_inf_filter:
+  desc: null
+  value: true
+save_strategy:
+  desc: null
+  value: epoch
+save_steps:
+  desc: null
+  value: 500
+save_total_limit:
+  desc: null
+  value: 4
+save_safetensors:
+  desc: null
+  value: true
+save_on_each_node:
+  desc: null
+  value: false
+save_only_model:
+  desc: null
+  value: false
+restore_callback_states_from_checkpoint:
+  desc: null
+  value: false
+no_cuda:
+  desc: null
+  value: false
+use_cpu:
+  desc: null
+  value: false
+use_mps_device:
+  desc: null
+  value: false
+seed:
+  desc: null
+  value: 42
+data_seed:
+  desc: null
+  value: null
+jit_mode_eval:
+  desc: null
+  value: false
+use_ipex:
+  desc: null
+  value: false
+bf16:
+  desc: null
+  value: true
+fp16:
+  desc: null
+  value: false
+fp16_opt_level:
+  desc: null
+  value: O1
+half_precision_backend:
+  desc: null
+  value: auto
+bf16_full_eval:
+  desc: null
+  value: false
+fp16_full_eval:
+  desc: null
+  value: false
+tf32:
+  desc: null
+  value: null
+local_rank:
+  desc: null
+  value: 0
+ddp_backend:
+  desc: null
+  value: null
+tpu_num_cores:
+  desc: null
+  value: null
+tpu_metrics_debug:
+  desc: null
+  value: false
+debug:
+  desc: null
+  value: []
+dataloader_drop_last:
+  desc: null
+  value: false
+eval_steps:
+  desc: null
+  value: null
+dataloader_num_workers:
+  desc: null
+  value: 0
+dataloader_prefetch_factor:
+  desc: null
+  value: null
+past_index:
+  desc: null
+  value: -1
+run_name:
+  desc: null
+  value: /kaggle/working/
+disable_tqdm:
+  desc: null
+  value: false
+remove_unused_columns:
+  desc: null
+  value: true
+label_names:
+  desc: null
+  value: null
+load_best_model_at_end:
+  desc: null
+  value: false
+metric_for_best_model:
+  desc: null
+  value: null
+greater_is_better:
+  desc: null
+  value: null
+ignore_data_skip:
+  desc: null
+  value: false
+fsdp:
+  desc: null
+  value: []
+fsdp_min_num_params:
+  desc: null
+  value: 0
+fsdp_config:
+  desc: null
+  value:
+    min_num_params: 0
+    xla: false
+    xla_fsdp_v2: false
+    xla_fsdp_grad_ckpt: false
+fsdp_transformer_layer_cls_to_wrap:
+  desc: null
+  value: null
+accelerator_config:
+  desc: null
+  value:
+    split_batches: false
+    dispatch_batches: null
+    even_batches: true
+    use_seedable_sampler: true
+    non_blocking: false
+    gradient_accumulation_kwargs: null
+deepspeed:
+  desc: null
+  value: null
+label_smoothing_factor:
+  desc: null
+  value: 0.0
+optim:
+  desc: null
+  value: adamw_torch
+optim_args:
+  desc: null
+  value: null
+adafactor:
+  desc: null
+  value: false
+group_by_length:
+  desc: null
+  value: false
+length_column_name:
+  desc: null
+  value: length
+report_to:
+  desc: null
+  value:
+  - tensorboard
+  - wandb
+ddp_find_unused_parameters:
+  desc: null
+  value: null
+ddp_bucket_cap_mb:
+  desc: null
+  value: null
+ddp_broadcast_buffers:
+  desc: null
+  value: null
+dataloader_pin_memory:
+  desc: null
+  value: true
+dataloader_persistent_workers:
+  desc: null
+  value: false
+skip_memory_metrics:
+  desc: null
+  value: true
+use_legacy_prediction_loop:
+  desc: null
+  value: false
+push_to_hub:
+  desc: null
+  value: false
+resume_from_checkpoint:
+  desc: null
+  value: null
+hub_model_id:
+  desc: null
+  value: null
+hub_strategy:
+  desc: null
+  value: every_save
+hub_token:
+  desc: null
+  value: <HUB_TOKEN>
+hub_private_repo:
+  desc: null
+  value: false
+hub_always_push:
+  desc: null
+  value: false
+gradient_checkpointing:
+  desc: null
+  value: false
+gradient_checkpointing_kwargs:
+  desc: null
+  value: null
+include_inputs_for_metrics:
+  desc: null
+  value: false
+eval_do_concat_batches:
+  desc: null
+  value: true
+fp16_backend:
+  desc: null
+  value: auto
+evaluation_strategy:
+  desc: null
+  value: null
+push_to_hub_model_id:
+  desc: null
+  value: null
+push_to_hub_organization:
+  desc: null
+  value: null
+push_to_hub_token:
+  desc: null
+  value: <PUSH_TO_HUB_TOKEN>
+mp_parameters:
+  desc: null
+  value: ''
+auto_find_batch_size:
+  desc: null
+  value: true
+full_determinism:
+  desc: null
+  value: false
+torchdynamo:
+  desc: null
+  value: null
+ray_scope:
+  desc: null
+  value: last
+ddp_timeout:
+  desc: null
+  value: 1800
+torch_compile:
+  desc: null
+  value: false
+torch_compile_backend:
+  desc: null
+  value: null
+torch_compile_mode:
+  desc: null
+  value: null
+dispatch_batches:
+  desc: null
+  value: null
+split_batches:
+  desc: null
+  value: null
+include_tokens_per_second:
+  desc: null
+  value: false
+include_num_input_tokens_seen:
+  desc: null
+  value: false
+neftune_noise_alpha:
+  desc: null
+  value: null
+optim_target_modules:
+  desc: null
+  value: null
+batch_eval_metrics:
+  desc: null
+  value: false
+model/num_parameters:
+  desc: null
+  value: 13033919160

wandb/run-20240522_112259-4b714brj/files/output.log ADDED Viewed

	@@ -0,0 +1,3 @@

+/opt/conda/lib/python3.10/site-packages/huggingface_hub/file_download.py:1132: FutureWarning: `resume_download` is deprecated and will be removed in version 1.0.0. Downloads always resume when possible. If you want to force a new download, use `force_download=True`.
+  warnings.warn(
+Overwriting /opt/conda/lib/python3.10/site-packages/torch/utils/data/_utils/fetch.py

wandb/run-20240522_112259-4b714brj/files/requirements.txt ADDED Viewed

	@@ -0,0 +1,878 @@

+Babel==2.14.0
+Boruta==0.3
+Brotli==1.0.9
+CVXcanon==0.1.2
+Cartopy==0.23.0
+Cython==3.0.8
+Deprecated==1.2.14
+Farama-Notifications==0.0.4
+Flask==3.0.3
+Geohash==1.0
+GitPython==3.1.41
+ImageHash==4.3.1
+Janome==0.5.0
+Jinja2==3.1.2
+LunarCalendar==0.0.9
+Mako==1.3.3
+Markdown==3.5.2
+MarkupSafe==2.1.3
+MarkupSafe==2.1.5
+Pillow==9.5.0
+PuLP==2.8.0
+PyArabic==0.6.15
+PyJWT==2.8.0
+PyMeeus==0.5.12
+PySocks==1.7.1
+PyUpSet==0.1.1.post7
+PyWavelets==1.5.0
+PyYAML==6.0.1
+Pygments==2.17.2
+Pympler==1.0.1
+QtPy==2.4.1
+Rtree==1.2.0
+SQLAlchemy==2.0.25
+SecretStorage==3.3.3
+Send2Trash==1.8.2
+Shapely==1.8.5.post1
+Shimmy==1.3.0
+SimpleITK==2.3.1
+TPOT==0.12.1
+Theano-PyMC==1.1.2
+Theano==1.0.5
+Wand==0.6.13
+Werkzeug==3.0.2
+absl-py==1.4.0
+accelerate==0.30.1
+access==1.1.9
+affine==2.4.0
+aiobotocore==2.12.3
+aiofiles==22.1.0
+aiohttp-cors==0.7.0
+aiohttp==3.9.1
+aioitertools==0.11.0
+aiorwlock==1.3.0
+aiosignal==1.3.1
+aiosqlite==0.19.0
+albumentations==1.4.0
+alembic==1.13.1
+altair==5.3.0
+annotated-types==0.6.0
+annoy==1.17.3
+anyio==4.2.0
+apache-beam==2.46.0
+aplus==0.11.0
+appdirs==1.4.4
+archspec==0.2.3
+argon2-cffi-bindings==21.2.0
+argon2-cffi==23.1.0
+array-record==0.5.0
+arrow==1.3.0
+arviz==0.18.0
+astroid==3.1.0
+astropy-iers-data==0.2024.4.15.2.45.49
+astropy==6.0.1
+asttokens==2.4.1
+astunparse==1.6.3
+async-lru==2.0.4
+async-timeout==4.0.3
+attrs==23.2.0
+audioread==3.0.1
+autopep8==2.0.4
+backoff==2.2.1
+bayesian-optimization==1.4.3
+beatrix_jupyterlab==2023.128.151533
+beautifulsoup4==4.12.2
+bitsandbytes==0.43.1
+blake3==0.2.1
+bleach==6.1.0
+blessed==1.20.0
+blinker==1.7.0
+blis==0.7.10
+blosc2==2.6.2
+bokeh==3.4.1
+boltons==23.1.1
+boto3==1.26.100
+botocore==1.34.69
+bq_helper==0.4.1
+bqplot==0.12.43
+branca==0.7.1
+brewer2mpl==1.4.1
+brotlipy==0.7.0
+cached-property==1.5.2
+cachetools==4.2.4
+cachetools==5.3.2
+catalogue==2.0.10
+catalyst==22.4
+catboost==1.2.3
+category-encoders==2.6.3
+certifi==2024.2.2
+cesium==0.12.1
+cffi==1.16.0
+charset-normalizer==3.3.2
+chex==0.1.86
+cleverhans==4.0.0
+click-plugins==1.1.1
+click==8.1.7
+cligj==0.7.2
+cloud-tpu-client==0.10
+cloud-tpu-profiler==2.4.0
+cloudpathlib==0.16.0
+cloudpickle==2.2.1
+cloudpickle==3.0.0
+cmdstanpy==1.2.2
+colorama==0.4.6
+colorcet==3.1.0
+colorful==0.5.6
+colorlog==6.8.2
+colorlover==0.3.0
+comm==0.2.1
+conda-libmamba-solver==23.7.0
+conda-package-handling==2.2.0
+conda==23.7.4
+conda_package_streaming==0.9.0
+confection==0.1.4
+contextily==1.6.0
+contourpy==1.2.0
+contourpy==1.2.1
+convertdate==2.4.0
+crcmod==1.7
+cryptography==41.0.7
+cuda-python==12.4.0
+cudf==23.8.0
+cufflinks==0.17.3
+cuml==23.8.0
+cupy==13.0.0
+cycler==0.12.1
+cymem==2.0.8
+cytoolz==0.12.3
+daal4py==2024.3.0
+daal==2024.3.0
+dacite==1.8.1
+dask-cuda==23.8.0
+dask-cudf==23.8.0
+dask-expr==1.0.11
+dask==2024.4.1
+dataclasses-json==0.6.4
+dataproc_jupyter_plugin==0.1.66
+datasets==2.18.0
+datashader==0.16.0
+datatile==1.0.3
+db-dtypes==1.2.0
+deap==1.4.1
+debugpy==1.8.0
+decorator==5.1.1
+deepdiff==7.0.1
+defusedxml==0.7.1
+deprecation==2.1.0
+descartes==1.1.0
+dill==0.3.8
+dipy==1.9.0
+distlib==0.3.8
+distributed==2023.7.1
+distro==1.9.0
+dm-tree==0.1.8
+docker-pycreds==0.4.0
+docker==7.0.0
+docopt==0.6.2
+docstring-parser==0.15
+docstring-to-markdown==0.15
+docutils==0.21.1
+earthengine-api==0.1.399
+easydict==1.13
+easyocr==1.7.1
+ecos==2.0.13
+einops==0.8.0
+eli5==0.13.0
+emoji==2.11.0
+en-core-web-lg==3.7.1
+en-core-web-sm==3.7.1
+entrypoints==0.4
+ephem==4.1.5
+esda==2.5.1
+essentia==2.1b6.dev1110
+et-xmlfile==1.1.0
+etils==1.6.0
+exceptiongroup==1.2.0
+executing==2.0.1
+explainable-ai-sdk==1.3.3
+fastai==2.7.14
+fastapi==0.108.0
+fastavro==1.9.3
+fastcore==1.5.29
+fastdownload==0.0.7
+fasteners==0.19
+fastjsonschema==2.19.1
+fastprogress==1.0.3
+fastrlock==0.8.2
+fasttext==0.9.2
+feather-format==0.4.1
+featuretools==1.30.0
+filelock==3.13.1
+fiona==1.9.6
+fitter==1.7.0
+flake8==7.0.0
+flashtext==2.7
+flatbuffers==23.5.26
+flax==0.8.2
+folium==0.16.0
+fonttools==4.47.0
+fonttools==4.51.0
+fqdn==1.5.1
+frozendict==2.4.2
+frozenlist==1.4.1
+fsspec==2024.2.0
+fsspec==2024.3.1
+funcy==2.0
+fury==0.10.0
+future==1.0.0
+fuzzywuzzy==0.18.0
+gast==0.5.4
+gatspy==0.3
+gcsfs==2024.2.0
+gensim==4.3.2
+geographiclib==2.0
+geojson==3.1.0
+geopandas==0.14.3
+geoplot==0.5.1
+geopy==2.4.1
+geoviews==1.12.0
+ggplot==0.11.5
+giddy==2.3.5
+gitdb==4.0.11
+google-ai-generativelanguage==0.6.2
+google-api-core==2.11.1
+google-api-core==2.18.0
+google-api-python-client==2.126.0
+google-apitools==0.5.31
+google-auth-httplib2==0.2.0
+google-auth-oauthlib==1.2.0
+google-auth==2.26.1
+google-cloud-aiplatform==0.6.0a1
+google-cloud-artifact-registry==1.10.0
+google-cloud-automl==1.0.1
+google-cloud-bigquery==2.34.4
+google-cloud-bigtable==1.7.3
+google-cloud-core==2.4.1
+google-cloud-datastore==2.19.0
+google-cloud-dlp==3.14.0
+google-cloud-jupyter-config==0.0.5
+google-cloud-language==2.13.3
+google-cloud-monitoring==2.18.0
+google-cloud-pubsub==2.19.0
+google-cloud-pubsublite==1.9.0
+google-cloud-recommendations-ai==0.7.1
+google-cloud-resource-manager==1.11.0
+google-cloud-spanner==3.40.1
+google-cloud-storage==1.44.0
+google-cloud-translate==3.12.1
+google-cloud-videointelligence==2.13.3
+google-cloud-vision==2.8.0
+google-crc32c==1.5.0
+google-generativeai==0.5.1
+google-pasta==0.2.0
+google-resumable-media==2.7.0
+googleapis-common-protos==1.62.0
+gplearn==0.4.2
+gpustat==1.0.0
+gpxpy==1.6.2
+graphviz==0.20.3
+greenlet==3.0.3
+grpc-google-iam-v1==0.12.7
+grpcio-status==1.48.1
+grpcio-status==1.48.2
+grpcio==1.51.1
+grpcio==1.60.0
+gviz-api==1.10.0
+gym-notices==0.0.8
+gym==0.26.2
+gymnasium==0.29.0
+h11==0.14.0
+h2o==3.46.0.1
+h5netcdf==1.3.0
+h5py==3.10.0
+haversine==2.8.1
+hdfs==2.7.3
+hep-ml==0.7.2
+hijri-converter==2.3.1
+hmmlearn==0.3.2
+holidays==0.24
+holoviews==1.18.3
+hpsklearn==0.1.0
+html5lib==1.1
+htmlmin==0.1.12
+httpcore==1.0.5
+httplib2==0.21.0
+httptools==0.6.1
+httpx==0.27.0
+huggingface-hub==0.23.1
+hunspell==0.5.5
+hydra-slayer==0.5.0
+hyperopt==0.2.7
+hypertools==0.8.0
+idna==3.6
+igraph==0.11.4
+imagecodecs==2024.1.1
+imageio==2.33.1
+imbalanced-learn==0.12.2
+imgaug==0.4.0
+importlib-metadata==6.11.0
+importlib-metadata==7.0.1
+importlib-resources==6.1.1
+inequality==1.0.1
+iniconfig==2.0.0
+ipydatawidgets==4.3.5
+ipykernel==6.28.0
+ipyleaflet==0.18.2
+ipympl==0.7.0
+ipython-genutils==0.2.0
+ipython-genutils==0.2.0
+ipython-sql==0.5.0
+ipython==8.20.0
+ipyvolume==0.6.3
+ipyvue==1.11.0
+ipyvuetify==1.9.4
+ipywebrtc==0.6.0
+ipywidgets==7.7.1
+isoduration==20.11.0
+isort==5.13.2
+isoweek==1.3.3
+itsdangerous==2.2.0
+jaraco.classes==3.3.0
+jax-jumpy==1.0.0
+jax==0.4.23
+jaxlib==0.4.23.dev20240116
+jedi==0.19.1
+jeepney==0.8.0
+jieba==0.42.1
+jmespath==1.0.1
+joblib==1.4.0
+json5==0.9.14
+jsonpatch==1.33
+jsonpointer==2.4
+jsonschema-specifications==2023.12.1
+jsonschema==4.20.0
+jupyter-console==6.6.3
+jupyter-events==0.9.0
+jupyter-http-over-ws==0.0.8
+jupyter-lsp==1.5.1
+jupyter-server-mathjax==0.2.6
+jupyter-ydoc==0.2.5
+jupyter_client==7.4.9
+jupyter_client==8.6.0
+jupyter_core==5.7.1
+jupyter_server==2.12.5
+jupyter_server_fileid==0.9.1
+jupyter_server_proxy==4.1.0
+jupyter_server_terminals==0.5.1
+jupyter_server_ydoc==0.8.0
+jupyterlab-lsp==5.1.0
+jupyterlab-widgets==3.0.9
+jupyterlab==4.1.6
+jupyterlab_git==0.44.0
+jupyterlab_pygments==0.3.0
+jupyterlab_server==2.25.2
+jupytext==1.16.0
+kaggle-environments==1.14.3
+kaggle==1.6.12
+kagglehub==0.2.3
+keras-cv==0.8.2
+keras-nlp==0.9.3
+keras-tuner==1.4.6
+keras==3.2.1
+kernels-mixer==0.0.7
+keyring==24.3.0
+keyrings.google-artifactregistry-auth==1.1.2
+kfp-pipeline-spec==0.2.2
+kfp-server-api==2.0.5
+kfp==2.5.0
+kiwisolver==1.4.5
+kmapper==2.0.1
+kmodes==0.12.2
+korean-lunar-calendar==0.3.1
+kornia==0.7.2
+kornia_rs==0.1.3
+kt-legacy==1.0.5
+kubernetes==26.1.0
+langcodes==3.3.0
+langid==1.1.6
+lazy_loader==0.3
+learntools==0.3.4
+leven==1.0.4
+libclang==16.0.6
+libmambapy==1.5.0
+libpysal==4.9.2
+librosa==0.10.1
+lightgbm==4.2.0
+lightning-utilities==0.11.2
+lime==0.2.0.1
+line-profiler==4.1.2
+linkify-it-py==2.0.3
+llvmlite==0.41.1
+llvmlite==0.42.0
+lml==0.1.0
+locket==1.0.0
+loguru==0.7.2
+loralib==0.1.2
+lxml==5.2.1
+lz4==4.3.3
+mamba==1.5.0
+mapclassify==2.6.1
+markdown-it-py==3.0.0
+marshmallow==3.21.1
+matplotlib-inline==0.1.6
+matplotlib-venn==0.11.10
+matplotlib==3.7.5
+matplotlib==3.8.4
+mccabe==0.7.0
+mdit-py-plugins==0.4.0
+mdurl==0.1.2
+memory-profiler==0.61.0
+menuinst==2.0.1
+mercantile==1.2.1
+mgwr==2.2.1
+missingno==0.5.2
+mistune==0.8.4
+mizani==0.11.1
+ml-dtypes==0.2.0
+mlcrate==0.2.0
+mlens==0.2.3
+mlxtend==0.23.1
+mne==1.6.1
+mnist==0.2.2
+momepy==0.7.0
+more-itertools==10.2.0
+mpld3==0.5.10
+mpmath==1.3.0
+msgpack==1.0.7
+multidict==6.0.4
+multimethod==1.10
+multipledispatch==1.0.0
+multiprocess==0.70.16
+munkres==1.1.4
+murmurhash==1.0.10
+mypy-extensions==1.0.0
+namex==0.0.8
+nb-conda-kernels==2.3.1
+nb_conda==2.2.1
+nbclassic==1.0.0
+nbclient==0.5.13
+nbconvert==6.4.5
+nbdime==3.2.0
+nbformat==5.9.2
+ndindex==1.8
+nest-asyncio==1.5.8
+networkx==3.2.1
+nibabel==5.2.1
+nilearn==0.10.4
+ninja==1.11.1.1
+nltk==3.2.4
+nose==1.3.7
+notebook==6.5.4
+notebook==6.5.6
+notebook_executor==0.2
+notebook_shim==0.2.3
+numba==0.58.1
+numba==0.59.1
+numexpr==2.10.0
+numpy==1.26.4
+nvidia-cublas-cu12==12.1.3.1
+nvidia-cuda-cupti-cu12==12.1.105
+nvidia-cuda-nvrtc-cu12==12.1.105
+nvidia-cuda-runtime-cu12==12.1.105
+nvidia-cudnn-cu12==8.9.2.26
+nvidia-cufft-cu12==11.0.2.54
+nvidia-curand-cu12==10.3.2.106
+nvidia-cusolver-cu12==11.4.5.107
+nvidia-cusparse-cu12==12.1.0.106
+nvidia-ml-py==11.495.46
+nvidia-nccl-cu12==2.20.5
+nvidia-nvjitlink-cu12==12.5.40
+nvidia-nvtx-cu12==12.1.105
+nvtx==0.2.10
+oauth2client==4.1.3
+oauthlib==3.2.2
+objsize==0.6.1
+odfpy==1.4.1
+olefile==0.47
+onnx==1.16.0
+opencensus-context==0.1.3
+opencensus==0.11.4
+opencv-contrib-python==4.9.0.80
+opencv-python-headless==4.9.0.80
+opencv-python==4.9.0.80
+openpyxl==3.1.2
+openslide-python==1.3.1
+opentelemetry-api==1.22.0
+opentelemetry-exporter-otlp-proto-common==1.22.0
+opentelemetry-exporter-otlp-proto-grpc==1.22.0
+opentelemetry-exporter-otlp-proto-http==1.22.0
+opentelemetry-exporter-otlp==1.22.0
+opentelemetry-proto==1.22.0
+opentelemetry-sdk==1.22.0
+opentelemetry-semantic-conventions==0.43b0
+opt-einsum==3.3.0
+optax==0.2.2
+optree==0.11.0
+optuna==3.6.1
+orbax-checkpoint==0.5.9
+ordered-set==4.1.0
+orjson==3.9.10
+ortools==9.4.1874
+osmnx==1.9.2
+overrides==7.4.0
+packaging==21.3
+pandas-datareader==0.10.0
+pandas-profiling==3.6.6
+pandas-summary==0.2.0
+pandas==2.1.4
+pandas==2.2.2
+pandasql==0.7.3
+pandocfilters==1.5.0
+panel==1.4.1
+papermill==2.5.0
+param==2.1.0
+parso==0.8.3
+partd==1.4.1
+path.py==12.5.0
+path==16.14.0
+pathos==0.3.2
+pathy==0.10.3
+patsy==0.5.6
+pdf2image==1.17.0
+peft==0.11.1
+pettingzoo==1.24.0
+pexpect==4.8.0
+pexpect==4.9.0
+phik==0.12.4
+pickleshare==0.7.5
+pillow==10.3.0
+pip==23.3.2
+pkgutil_resolve_name==1.3.10
+platformdirs==4.2.0
+plotly-express==0.4.1
+plotly==5.18.0
+plotnine==0.13.4
+pluggy==1.4.0
+pointpats==2.4.0
+polars==0.20.21
+polyglot==16.7.4
+pooch==1.8.1
+pox==0.3.4
+ppca==0.0.4
+ppft==1.7.6.8
+preprocessing==0.1.13
+preshed==3.0.9
+prettytable==3.9.0
+progressbar2==4.4.2
+prometheus-client==0.19.0
+promise==2.3
+prompt-toolkit==3.0.42
+prompt-toolkit==3.0.43
+prophet==1.1.1
+proto-plus==1.23.0
+protobuf==3.20.3
+protobuf==4.21.12
+psutil==5.9.3
+psutil==5.9.7
+ptyprocess==0.7.0
+pudb==2024.1
+pure-eval==0.2.2
+py-cpuinfo==9.0.0
+py-spy==0.3.14
+py4j==0.10.9.7
+pyLDAvis==3.4.1
+pyOpenSSL==23.3.0
+pyaml==23.12.0
+pyarrow-hotfix==0.6
+pyarrow==15.0.2
+pyasn1-modules==0.3.0
+pyasn1==0.5.1
+pybind11==2.12.0
+pyclipper==1.3.0.post5
+pycodestyle==2.11.1
+pycosat==0.6.6
+pycparser==2.21
+pycryptodome==3.20.0
+pyct==0.5.0
+pycuda==2024.1
+pydantic==2.5.3
+pydantic==2.7.0
+pydantic_core==2.14.6
+pydantic_core==2.18.1
+pydegensac==0.1.2
+pydicom==2.4.4
+pydocstyle==6.3.0
+pydot==1.4.2
+pydub==0.25.1
+pyemd==1.0.0
+pyerfa==2.0.1.4
+pyexcel-io==0.6.6
+pyexcel-ods==0.6.0
+pyflakes==3.2.0
+pygltflib==1.16.2
+pykalman==0.9.7
+pylibraft==23.8.0
+pylint==3.1.0
+pymc3==3.11.4
+pymongo==3.13.0
+pynndescent==0.5.12
+pynvml==11.4.1
+pynvrtc==9.2
+pyparsing==3.1.1
+pyparsing==3.1.2
+pypdf==4.2.0
+pyproj==3.6.1
+pysal==24.1
+pyshp==2.3.1
+pytesseract==0.3.10
+pytest==8.1.1
+python-bidi==0.4.2
+python-dateutil==2.9.0.post0
+python-dotenv==1.0.0
+python-json-logger==2.0.7
+python-louvain==0.16
+python-lsp-jsonrpc==1.1.2
+python-lsp-server==1.11.0
+python-slugify==8.0.4
+python-utils==3.8.2
+pythreejs==2.4.2
+pytoolconfig==1.3.1
+pytools==2024.1.1
+pytorch-ignite==0.5.0.post2
+pytorch-lightning==2.2.2
+pytz==2023.3.post1
+pytz==2024.1
+pyu2f==0.1.5
+pyviz_comms==3.0.2
+pyzmq==24.0.1
+pyzmq==25.1.2
+qgrid==1.3.1
+qtconsole==5.5.1
+quantecon==0.7.2
+qudida==0.0.4
+raft-dask==23.8.0
+rasterio==1.3.10
+rasterstats==0.19.0
+ray-cpp==2.9.0
+ray==2.9.0
+referencing==0.32.1
+regex==2023.12.25
+requests-oauthlib==1.3.1
+requests-toolbelt==0.10.1
+requests==2.31.0
+retrying==1.3.3
+retrying==1.3.4
+rfc3339-validator==0.1.4
+rfc3986-validator==0.1.1
+rgf-python==3.12.0
+rich-click==1.7.4
+rich==13.7.0
+rich==13.7.1
+rmm==23.8.0
+rope==1.13.0
+rpds-py==0.16.2
+rsa==4.9
+ruamel-yaml-conda==0.15.100
+ruamel.yaml.clib==0.2.7
+ruamel.yaml==0.17.40
+s2sphere==0.2.5
+s3fs==2024.2.0
+s3transfer==0.6.2
+safetensors==0.4.3
+scattertext==0.1.19
+scikit-image==0.22.0
+scikit-learn-intelex==2024.3.0
+scikit-learn==1.2.2
+scikit-multilearn==0.2.0
+scikit-optimize==0.10.1
+scikit-plot==0.3.7
+scikit-surprise==1.1.3
+scipy==1.11.4
+scipy==1.13.0
+seaborn==0.12.2
+segment_anything==1.0
+segregation==2.5
+semver==3.0.2
+sentencepiece==0.2.0
+sentry-sdk==1.45.0
+setproctitle==1.3.3
+setuptools-git==1.2
+setuptools-scm==8.0.4
+setuptools==69.0.3
+shap==0.44.1
+shapely==2.0.4
+shellingham==1.5.4
+simpervisor==1.0.0
+simplejson==3.19.2
+six==1.16.0
+sklearn-pandas==2.2.0
+slicer==0.0.7
+smart-open==6.4.0
+smmap==5.0.1
+sniffio==1.3.0
+snowballstemmer==2.2.0
+snuggs==1.4.7
+sortedcontainers==2.4.0
+soundfile==0.12.1
+soupsieve==2.5
+soxr==0.3.7
+spacy-legacy==3.0.12
+spacy-loggers==1.0.5
+spacy==3.7.3
+spaghetti==1.7.5.post1
+spectral==0.23.1
+spglm==1.1.0
+sphinx-rtd-theme==0.2.4
+spint==1.0.7
+splot==1.1.5.post1
+spopt==0.6.0
+spreg==1.4.2
+spvcm==0.3.0
+sqlparse==0.4.4
+squarify==0.4.3
+srsly==2.4.8
+stable-baselines3==2.1.0
+stack-data==0.6.2
+stack-data==0.6.3
+stanio==0.5.0
+starlette==0.32.0.post1
+statsmodels==0.14.1
+stemming==1.0.1
+stop-words==2018.7.23
+stopit==1.1.2
+stumpy==1.12.0
+sympy==1.12
+tables==3.9.2
+tabulate==0.9.0
+tangled-up-in-unicode==0.2.0
+tbb==2021.12.0
+tblib==3.0.0
+tenacity==8.2.3
+tensorboard-data-server==0.7.2
+tensorboard-plugin-profile==2.15.0
+tensorboard==2.15.1
+tensorboardX==2.6.2.2
+tensorflow-cloud==0.1.16
+tensorflow-datasets==4.9.4
+tensorflow-decision-forests==1.8.1
+tensorflow-estimator==2.15.0
+tensorflow-hub==0.16.1
+tensorflow-io-gcs-filesystem==0.35.0
+tensorflow-io==0.35.0
+tensorflow-metadata==0.14.0
+tensorflow-probability==0.23.0
+tensorflow-serving-api==2.14.1
+tensorflow-text==2.15.0
+tensorflow-transform==0.14.0
+tensorflow==2.15.0
+tensorstore==0.1.56
+termcolor==2.4.0
+terminado==0.18.0
+testpath==0.6.0
+text-unidecode==1.3
+textblob==0.18.0.post0
+texttable==1.7.0
+tf_keras==2.15.1
+tfp-nightly==0.24.0.dev0
+thinc==8.2.2
+threadpoolctl==3.2.0
+tifffile==2023.12.9
+timm==0.9.16
+tinycss2==1.2.1
+tobler==0.11.2
+tokenizers==0.19.1
+toml==0.10.2
+tomli==2.0.1
+tomlkit==0.12.4
+toolz==0.12.1
+torch==2.3.0
+torchaudio==2.1.2
+torchdata==0.7.1
+torchinfo==1.8.0
+torchmetrics==1.3.2
+torchtext==0.16.2
+torchvision==0.16.2
+tornado==6.3.3
+tqdm==4.66.1
+traceml==1.0.8
+traitlets==5.9.0
+traittypes==0.2.1
+transformers==4.41.0
+treelite-runtime==3.2.0
+treelite==3.2.0
+triton==2.3.0
+truststore==0.8.0
+trx-python==0.2.9
+tsfresh==0.20.2
+typeguard==4.1.5
+typer==0.9.0
+typer==0.9.4
+types-python-dateutil==2.8.19.20240106
+typing-inspect==0.9.0
+typing-utils==0.1.0
+typing_extensions==4.9.0
+tzdata==2023.4
+uc-micro-py==1.0.3
+ucx-py==0.33.0
+ujson==5.9.0
+umap-learn==0.5.6
+unicodedata2==15.1.0
+update-checker==0.18.0
+uri-template==1.3.0
+uritemplate==3.0.1
+urllib3==1.26.18
+urllib3==2.1.0
+urwid==2.6.10
+urwid_readline==0.14
+uvicorn==0.25.0
+uvloop==0.19.0
+vaex-astro==0.9.3
+vaex-core==4.17.1
+vaex-hdf5==0.14.1
+vaex-jupyter==0.8.2
+vaex-ml==0.18.3
+vaex-server==0.9.0
+vaex-viz==0.5.4
+vaex==4.17.0
+vec_noise==1.1.4
+vecstack==0.4.0
+virtualenv==20.21.0
+visions==0.7.5
+vowpalwabbit==9.9.0
+vtk==9.3.0
+wandb==0.16.6
+wasabi==1.1.2
+watchfiles==0.21.0
+wavio==0.0.8
+wcwidth==0.2.13
+weasel==0.3.4
+webcolors==1.13
+webencodings==0.5.1
+websocket-client==1.7.0
+websockets==12.0
+wfdb==4.1.2
+whatthepatch==1.0.5
+wheel==0.42.0
+widgetsnbextension==3.6.6
+witwidget==1.8.1
+woodwork==0.30.0
+wordcloud==1.9.3
+wordsegment==1.3.1
+wrapt==1.14.1
+xarray-einstats==0.7.0
+xarray==2024.3.0
+xformers==0.0.26.post1
+xgboost==2.0.3
+xvfbwrapper==0.2.9
+xxhash==3.4.1
+xyzservices==2024.4.0
+y-py==0.6.2
+yapf==0.40.2
+yarl==1.9.3
+yarl==1.9.4
+ydata-profiling==4.6.4
+yellowbrick==1.5
+ypy-websocket==0.8.4
+zict==3.0.0
+zipp==3.17.0
+zstandard==0.22.0

wandb/run-20240522_112259-4b714brj/files/wandb-metadata.json ADDED Viewed

	@@ -0,0 +1,62 @@

+{
+    "os": "Linux-5.15.133+-x86_64-with-glibc2.31",
+    "python": "3.10.13",
+    "heartbeatAt": "2024-05-22T11:23:00.864653",
+    "startedAt": "2024-05-22T11:22:59.891144",
+    "docker": null,
+    "cuda": null,
+    "args": [],
+    "state": "running",
+    "program": "kaggle.ipynb",
+    "codePathLocal": null,
+    "root": "/kaggle/working",
+    "host": "2c1b614ec68f",
+    "username": "root",
+    "executable": "/opt/conda/bin/python3.10",
+    "cpu_count": 2,
+    "cpu_count_logical": 4,
+    "cpu_freq": {
+        "current": 2000.144,
+        "min": 0.0,
+        "max": 0.0
+    },
+    "cpu_freq_per_core": [
+        {
+            "current": 2000.144,
+            "min": 0.0,
+            "max": 0.0
+        },
+        {
+            "current": 2000.144,
+            "min": 0.0,
+            "max": 0.0
+        },
+        {
+            "current": 2000.144,
+            "min": 0.0,
+            "max": 0.0
+        },
+        {
+            "current": 2000.144,
+            "min": 0.0,
+            "max": 0.0
+        }
+    ],
+    "disk": {
+        "/": {
+            "total": 8062.387607574463,
+            "used": 5656.321590423584
+        }
+    },
+    "gpu": "Tesla P100-PCIE-16GB",
+    "gpu_count": 1,
+    "gpu_devices": [
+        {
+            "name": "Tesla P100-PCIE-16GB",
+            "memory_total": 17179869184
+        }
+    ],
+    "memory": {
+        "total": 31.357563018798828
+    }
+}

wandb/run-20240522_112259-4b714brj/files/wandb-summary.json ADDED Viewed

	@@ -0,0 +1 @@


1	+ {"_wandb": {"runtime": 23}}

wandb/run-20240522_112259-4b714brj/logs/debug-internal.log ADDED Viewed

	@@ -0,0 +1,308 @@

+2024-05-22 11:22:59,898 INFO    StreamThr :151 [internal.py:wandb_internal():86] W&B internal server running at pid: 151, started at: 2024-05-22 11:22:59.898127
+2024-05-22 11:22:59,900 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: status
+2024-05-22 11:23:00,555 INFO    WriterThread:151 [datastore.py:open_for_write():87] open: /kaggle/working/wandb/run-20240522_112259-4b714brj/run-4b714brj.wandb
+2024-05-22 11:23:00,555 DEBUG   SenderThread:151 [sender.py:send():379] send: header
+2024-05-22 11:23:00,561 DEBUG   SenderThread:151 [sender.py:send():379] send: run
+2024-05-22 11:23:00,763 INFO    SenderThread:151 [dir_watcher.py:__init__():211] watching files in: /kaggle/working/wandb/run-20240522_112259-4b714brj/files
+2024-05-22 11:23:00,764 INFO    SenderThread:151 [sender.py:_start_run_threads():1124] run started: 4b714brj with start time 1716376979.899342
+2024-05-22 11:23:00,767 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: check_version
+2024-05-22 11:23:00,767 DEBUG   SenderThread:151 [sender.py:send_request():406] send_request: check_version
+2024-05-22 11:23:00,842 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: run_start
+2024-05-22 11:23:00,854 DEBUG   HandlerThread:151 [system_info.py:__init__():26] System info init
+2024-05-22 11:23:00,854 DEBUG   HandlerThread:151 [system_info.py:__init__():41] System info init done
+2024-05-22 11:23:00,854 INFO    HandlerThread:151 [system_monitor.py:start():194] Starting system monitor
+2024-05-22 11:23:00,854 INFO    SystemMonitor:151 [system_monitor.py:_start():158] Starting system asset monitoring threads
+2024-05-22 11:23:00,854 INFO    HandlerThread:151 [system_monitor.py:probe():214] Collecting system info
+2024-05-22 11:23:00,855 INFO    SystemMonitor:151 [interfaces.py:start():190] Started cpu monitoring
+2024-05-22 11:23:00,855 INFO    SystemMonitor:151 [interfaces.py:start():190] Started disk monitoring
+2024-05-22 11:23:00,856 INFO    SystemMonitor:151 [interfaces.py:start():190] Started gpu monitoring
+2024-05-22 11:23:00,858 INFO    SystemMonitor:151 [interfaces.py:start():190] Started memory monitoring
+2024-05-22 11:23:00,858 INFO    SystemMonitor:151 [interfaces.py:start():190] Started network monitoring
+2024-05-22 11:23:00,864 DEBUG   HandlerThread:151 [system_info.py:probe():150] Probing system
+2024-05-22 11:23:00,867 DEBUG   HandlerThread:151 [gitlib.py:_init_repo():56] git repository is invalid
+2024-05-22 11:23:00,867 DEBUG   HandlerThread:151 [system_info.py:probe():198] Probing system done
+2024-05-22 11:23:00,867 DEBUG   HandlerThread:151 [system_monitor.py:probe():223] {'os': 'Linux-5.15.133+-x86_64-with-glibc2.31', 'python': '3.10.13', 'heartbeatAt': '2024-05-22T11:23:00.864653', 'startedAt': '2024-05-22T11:22:59.891144', 'docker': None, 'cuda': None, 'args': (), 'state': 'running', 'program': 'kaggle.ipynb', 'codePathLocal': None, 'root': '/kaggle/working', 'host': '2c1b614ec68f', 'username': 'root', 'executable': '/opt/conda/bin/python3.10', 'cpu_count': 2, 'cpu_count_logical': 4, 'cpu_freq': {'current': 2000.144, 'min': 0.0, 'max': 0.0}, 'cpu_freq_per_core': [{'current': 2000.144, 'min': 0.0, 'max': 0.0}, {'current': 2000.144, 'min': 0.0, 'max': 0.0}, {'current': 2000.144, 'min': 0.0, 'max': 0.0}, {'current': 2000.144, 'min': 0.0, 'max': 0.0}], 'disk': {'/': {'total': 8062.387607574463, 'used': 5656.321590423584}}, 'gpu': 'Tesla P100-PCIE-16GB', 'gpu_count': 1, 'gpu_devices': [{'name': 'Tesla P100-PCIE-16GB', 'memory_total': 17179869184}], 'memory': {'total': 31.357563018798828}}
+2024-05-22 11:23:00,867 INFO    HandlerThread:151 [system_monitor.py:probe():224] Finished collecting system info
+2024-05-22 11:23:00,867 INFO    HandlerThread:151 [system_monitor.py:probe():227] Publishing system info
+2024-05-22 11:23:00,867 DEBUG   HandlerThread:151 [system_info.py:_save_conda():207] Saving list of conda packages installed into the current environment
+2024-05-22 11:23:01,766 INFO    Thread-12 :151 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240522_112259-4b714brj/files/conda-environment.yaml
+2024-05-22 11:23:15,885 ERROR   HandlerThread:151 [system_info.py:_save_conda():221] Error saving conda packages: Command '['conda', 'env', 'export']' timed out after 15 seconds
+Traceback (most recent call last):
+  File "/opt/conda/lib/python3.10/site-packages/wandb/sdk/internal/system/system_info.py", line 214, in _save_conda
+    subprocess.call(
+  File "/opt/conda/lib/python3.10/subprocess.py", line 347, in call
+    return p.wait(timeout=timeout)
+  File "/opt/conda/lib/python3.10/subprocess.py", line 1209, in wait
+    return self._wait(timeout=timeout)
+  File "/opt/conda/lib/python3.10/subprocess.py", line 1951, in _wait
+    raise TimeoutExpired(self.args, timeout)
+subprocess.TimeoutExpired: Command '['conda', 'env', 'export']' timed out after 15 seconds
+2024-05-22 11:23:15,887 DEBUG   HandlerThread:151 [system_info.py:_save_conda():222] Saving conda packages done
+2024-05-22 11:23:15,888 INFO    HandlerThread:151 [system_monitor.py:probe():229] Finished publishing system info
+2024-05-22 11:23:15,896 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: status_report
+2024-05-22 11:23:15,896 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: keepalive
+2024-05-22 11:23:15,896 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: status_report
+2024-05-22 11:23:15,896 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: keepalive
+2024-05-22 11:23:15,896 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: status_report
+2024-05-22 11:23:15,896 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: keepalive
+2024-05-22 11:23:15,897 DEBUG   SenderThread:151 [sender.py:send():379] send: files
+2024-05-22 11:23:15,897 INFO    SenderThread:151 [sender.py:_save_file():1390] saving file wandb-metadata.json with policy now
+2024-05-22 11:23:16,228 INFO    wandb-upload_0:151 [upload_job.py:push():131] Uploaded file /tmp/tmpbupdg2uiwandb/jmuw7ldk-wandb-metadata.json
+2024-05-22 11:23:16,769 INFO    Thread-12 :151 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240522_112259-4b714brj/files/wandb-metadata.json
+2024-05-22 11:23:16,882 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: python_packages
+2024-05-22 11:23:16,882 DEBUG   SenderThread:151 [sender.py:send_request():406] send_request: python_packages
+2024-05-22 11:23:16,885 DEBUG   SenderThread:151 [sender.py:send():379] send: telemetry
+2024-05-22 11:23:16,896 DEBUG   SenderThread:151 [sender.py:send():379] send: config
+2024-05-22 11:23:16,898 DEBUG   SenderThread:151 [sender.py:send():379] send: metric
+2024-05-22 11:23:16,899 DEBUG   SenderThread:151 [sender.py:send():379] send: telemetry
+2024-05-22 11:23:16,899 DEBUG   SenderThread:151 [sender.py:send():379] send: metric
+2024-05-22 11:23:16,899 WARNING SenderThread:151 [sender.py:send_metric():1341] Seen metric with glob (shouldn't happen)
+2024-05-22 11:23:16,899 DEBUG   SenderThread:151 [sender.py:send():379] send: telemetry
+2024-05-22 11:23:16,903 DEBUG   SenderThread:151 [sender.py:send():379] send: telemetry
+2024-05-22 11:23:16,903 DEBUG   SenderThread:151 [sender.py:send():379] send: config
+2024-05-22 11:23:16,904 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: stop_status
+2024-05-22 11:23:16,904 DEBUG   SenderThread:151 [sender.py:send_request():406] send_request: stop_status
+2024-05-22 11:23:16,907 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: internal_messages
+2024-05-22 11:23:17,769 INFO    Thread-12 :151 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240522_112259-4b714brj/files/output.log
+2024-05-22 11:23:17,770 INFO    Thread-12 :151 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240522_112259-4b714brj/files/requirements.txt
+2024-05-22 11:23:17,858 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: log_artifact
+2024-05-22 11:23:17,858 DEBUG   SenderThread:151 [sender.py:send_request():406] send_request: log_artifact
+2024-05-22 11:23:18,625 INFO    wandb-upload_1:151 [upload_job.py:push():89] Uploaded file /tmp/tmpfgt5iaju/adapter_config.json
+2024-05-22 11:23:18,645 INFO    wandb-upload_3:151 [upload_job.py:push():89] Uploaded file /tmp/tmpfgt5iaju/model_architecture.txt
+2024-05-22 11:23:18,661 INFO    wandb-upload_0:151 [upload_job.py:push():89] Uploaded file /tmp/tmpfgt5iaju/README.md
+2024-05-22 11:23:19,067 INFO    wandb-upload_2:151 [upload_job.py:push():89] Uploaded file /tmp/tmpfgt5iaju/adapter_model.safetensors
+2024-05-22 11:23:19,580 INFO    SenderThread:151 [sender.py:send_request_log_artifact():1456] logged artifact model-4b714brj - {'id': 'QXJ0aWZhY3Q6ODQ2MDMyMjU0', 'state': 'PENDING', 'artifactSequence': {'id': 'QXJ0aWZhY3RDb2xsZWN0aW9uOjE3Nzk2OTgzMA==', 'latestArtifact': None}}
+2024-05-22 11:23:19,770 INFO    Thread-12 :151 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240522_112259-4b714brj/files/output.log
+2024-05-22 11:23:20,547 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: pause
+2024-05-22 11:23:20,547 INFO    HandlerThread:151 [handler.py:handle_request_pause():708] stopping system metrics thread
+2024-05-22 11:23:20,547 INFO    HandlerThread:151 [system_monitor.py:finish():203] Stopping system monitor
+2024-05-22 11:23:20,548 DEBUG   SystemMonitor:151 [system_monitor.py:_start():172] Starting system metrics aggregation loop
+2024-05-22 11:23:20,548 DEBUG   SystemMonitor:151 [system_monitor.py:_start():179] Finished system metrics aggregation loop
+2024-05-22 11:23:20,548 DEBUG   SystemMonitor:151 [system_monitor.py:_start():183] Publishing last batch of metrics
+2024-05-22 11:23:20,548 INFO    HandlerThread:151 [interfaces.py:finish():202] Joined cpu monitor
+2024-05-22 11:23:20,549 INFO    HandlerThread:151 [interfaces.py:finish():202] Joined disk monitor
+2024-05-22 11:23:20,555 INFO    HandlerThread:151 [interfaces.py:finish():202] Joined gpu monitor
+2024-05-22 11:23:20,555 INFO    HandlerThread:151 [interfaces.py:finish():202] Joined memory monitor
+2024-05-22 11:23:20,555 INFO    HandlerThread:151 [interfaces.py:finish():202] Joined network monitor
+2024-05-22 11:23:21,581 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: status_report
+2024-05-22 11:23:26,583 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: status_report
+2024-05-22 11:23:31,589 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: status_report
+2024-05-22 11:23:31,775 INFO    Thread-12 :151 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240522_112259-4b714brj/files/config.yaml
+2024-05-22 11:23:31,883 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: stop_status
+2024-05-22 11:23:31,883 DEBUG   SenderThread:151 [sender.py:send_request():406] send_request: stop_status
+2024-05-22 11:23:31,886 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: internal_messages
+2024-05-22 11:23:36,999 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: status_report
+2024-05-22 11:23:42,000 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: status_report
+2024-05-22 11:23:45,679 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: resume
+2024-05-22 11:23:45,680 INFO    HandlerThread:151 [handler.py:handle_request_resume():699] starting system metrics thread
+2024-05-22 11:23:45,680 INFO    HandlerThread:151 [system_monitor.py:start():194] Starting system monitor
+2024-05-22 11:23:45,680 INFO    SystemMonitor:151 [system_monitor.py:_start():158] Starting system asset monitoring threads
+2024-05-22 11:23:45,681 INFO    SystemMonitor:151 [interfaces.py:start():190] Started cpu monitoring
+2024-05-22 11:23:45,681 INFO    SystemMonitor:151 [interfaces.py:start():190] Started disk monitoring
+2024-05-22 11:23:45,682 INFO    SystemMonitor:151 [interfaces.py:start():190] Started gpu monitoring
+2024-05-22 11:23:45,685 INFO    SystemMonitor:151 [interfaces.py:start():190] Started memory monitoring
+2024-05-22 11:23:45,685 INFO    SystemMonitor:151 [interfaces.py:start():190] Started network monitoring
+2024-05-22 11:23:45,720 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: pause
+2024-05-22 11:23:45,720 INFO    HandlerThread:151 [handler.py:handle_request_pause():708] stopping system metrics thread
+2024-05-22 11:23:45,720 INFO    HandlerThread:151 [system_monitor.py:finish():203] Stopping system monitor
+2024-05-22 11:23:45,721 DEBUG   SystemMonitor:151 [system_monitor.py:_start():172] Starting system metrics aggregation loop
+2024-05-22 11:23:45,721 DEBUG   SystemMonitor:151 [system_monitor.py:_start():179] Finished system metrics aggregation loop
+2024-05-22 11:23:45,721 DEBUG   SystemMonitor:151 [system_monitor.py:_start():183] Publishing last batch of metrics
+2024-05-22 11:23:45,722 INFO    HandlerThread:151 [interfaces.py:finish():202] Joined cpu monitor
+2024-05-22 11:23:45,722 INFO    HandlerThread:151 [interfaces.py:finish():202] Joined disk monitor
+2024-05-22 11:23:45,727 INFO    HandlerThread:151 [interfaces.py:finish():202] Joined gpu monitor
+2024-05-22 11:23:45,727 INFO    HandlerThread:151 [interfaces.py:finish():202] Joined memory monitor
+2024-05-22 11:23:45,728 INFO    HandlerThread:151 [interfaces.py:finish():202] Joined network monitor
+2024-05-22 11:23:45,728 DEBUG   SenderThread:151 [sender.py:send():379] send: stats
+2024-05-22 11:23:46,883 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: stop_status
+2024-05-22 11:23:46,883 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: internal_messages
+2024-05-22 11:23:46,884 DEBUG   SenderThread:151 [sender.py:send_request():406] send_request: stop_status
+2024-05-22 11:23:47,780 INFO    Thread-12 :151 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240522_112259-4b714brj/files/output.log
+2024-05-22 11:23:47,995 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: status_report
+2024-05-22 11:23:52,144 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: resume
+2024-05-22 11:23:52,144 INFO    HandlerThread:151 [handler.py:handle_request_resume():699] starting system metrics thread
+2024-05-22 11:23:52,144 INFO    HandlerThread:151 [system_monitor.py:start():194] Starting system monitor
+2024-05-22 11:23:52,144 INFO    SystemMonitor:151 [system_monitor.py:_start():158] Starting system asset monitoring threads
+2024-05-22 11:23:52,145 INFO    SystemMonitor:151 [interfaces.py:start():190] Started cpu monitoring
+2024-05-22 11:23:52,145 INFO    SystemMonitor:151 [interfaces.py:start():190] Started disk monitoring
+2024-05-22 11:23:52,147 INFO    SystemMonitor:151 [interfaces.py:start():190] Started gpu monitoring
+2024-05-22 11:23:52,147 INFO    SystemMonitor:151 [interfaces.py:start():190] Started memory monitoring
+2024-05-22 11:23:52,148 INFO    SystemMonitor:151 [interfaces.py:start():190] Started network monitoring
+2024-05-22 11:23:52,185 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: pause
+2024-05-22 11:23:52,185 INFO    HandlerThread:151 [handler.py:handle_request_pause():708] stopping system metrics thread
+2024-05-22 11:23:52,185 INFO    HandlerThread:151 [system_monitor.py:finish():203] Stopping system monitor
+2024-05-22 11:23:52,185 INFO    HandlerThread:151 [interfaces.py:finish():202] Joined cpu monitor
+2024-05-22 11:23:52,185 DEBUG   SystemMonitor:151 [system_monitor.py:_start():172] Starting system metrics aggregation loop
+2024-05-22 11:23:52,186 INFO    HandlerThread:151 [interfaces.py:finish():202] Joined disk monitor
+2024-05-22 11:23:52,186 DEBUG   SystemMonitor:151 [system_monitor.py:_start():179] Finished system metrics aggregation loop
+2024-05-22 11:23:52,186 DEBUG   SystemMonitor:151 [system_monitor.py:_start():183] Publishing last batch of metrics
+2024-05-22 11:23:52,191 INFO    HandlerThread:151 [interfaces.py:finish():202] Joined gpu monitor
+2024-05-22 11:23:52,191 INFO    HandlerThread:151 [interfaces.py:finish():202] Joined memory monitor
+2024-05-22 11:23:52,192 INFO    HandlerThread:151 [interfaces.py:finish():202] Joined network monitor
+2024-05-22 11:23:52,192 DEBUG   SenderThread:151 [sender.py:send():379] send: stats
+2024-05-22 11:23:53,193 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: status_report
+2024-05-22 11:23:58,194 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: status_report
+2024-05-22 11:24:01,883 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: stop_status
+2024-05-22 11:24:01,883 DEBUG   SenderThread:151 [sender.py:send_request():406] send_request: stop_status
+2024-05-22 11:24:01,885 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: internal_messages
+2024-05-22 11:24:02,823 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: resume
+2024-05-22 11:24:02,823 INFO    HandlerThread:151 [handler.py:handle_request_resume():699] starting system metrics thread
+2024-05-22 11:24:02,823 INFO    HandlerThread:151 [system_monitor.py:start():194] Starting system monitor
+2024-05-22 11:24:02,824 INFO    SystemMonitor:151 [system_monitor.py:_start():158] Starting system asset monitoring threads
+2024-05-22 11:24:02,828 INFO    SystemMonitor:151 [interfaces.py:start():190] Started cpu monitoring
+2024-05-22 11:24:02,829 INFO    SystemMonitor:151 [interfaces.py:start():190] Started disk monitoring
+2024-05-22 11:24:02,831 INFO    SystemMonitor:151 [interfaces.py:start():190] Started gpu monitoring
+2024-05-22 11:24:02,835 INFO    SystemMonitor:151 [interfaces.py:start():190] Started memory monitoring
+2024-05-22 11:24:02,836 INFO    SystemMonitor:151 [interfaces.py:start():190] Started network monitoring
+2024-05-22 11:24:04,031 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: status_report
+2024-05-22 11:24:05,597 DEBUG   SenderThread:151 [sender.py:send():379] send: config
+2024-05-22 11:24:05,599 DEBUG   SenderThread:151 [sender.py:send():379] send: metric
+2024-05-22 11:24:05,599 DEBUG   SenderThread:151 [sender.py:send():379] send: metric
+2024-05-22 11:24:05,599 WARNING SenderThread:151 [sender.py:send_metric():1341] Seen metric with glob (shouldn't happen)
+2024-05-22 11:24:05,602 DEBUG   SenderThread:151 [sender.py:send():379] send: config
+2024-05-22 11:24:06,395 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: log_artifact
+2024-05-22 11:24:06,395 DEBUG   SenderThread:151 [sender.py:send_request():406] send_request: log_artifact
+2024-05-22 11:24:06,657 INFO    SenderThread:151 [sender.py:send_request_log_artifact():1456] logged artifact model-4b714brj - {'id': 'QXJ0aWZhY3Q6ODQ2MDMyMjU0', 'state': 'COMMITTED', 'artifactSequence': {'id': 'QXJ0aWZhY3RDb2xsZWN0aW9uOjE3Nzk2OTgzMA==', 'latestArtifact': {'id': 'QXJ0aWZhY3Q6ODQ2MDMyMjU0', 'versionIndex': 0}}}
+2024-05-22 11:24:06,748 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: pause
+2024-05-22 11:24:06,748 INFO    HandlerThread:151 [handler.py:handle_request_pause():708] stopping system metrics thread
+2024-05-22 11:24:06,748 INFO    HandlerThread:151 [system_monitor.py:finish():203] Stopping system monitor
+2024-05-22 11:24:06,748 DEBUG   SystemMonitor:151 [system_monitor.py:_start():172] Starting system metrics aggregation loop
+2024-05-22 11:24:06,749 DEBUG   SystemMonitor:151 [system_monitor.py:_start():179] Finished system metrics aggregation loop
+2024-05-22 11:24:06,749 DEBUG   SystemMonitor:151 [system_monitor.py:_start():183] Publishing last batch of metrics
+2024-05-22 11:24:06,750 INFO    HandlerThread:151 [interfaces.py:finish():202] Joined cpu monitor
+2024-05-22 11:24:06,750 INFO    HandlerThread:151 [interfaces.py:finish():202] Joined disk monitor
+2024-05-22 11:24:06,755 INFO    HandlerThread:151 [interfaces.py:finish():202] Joined gpu monitor
+2024-05-22 11:24:06,756 INFO    HandlerThread:151 [interfaces.py:finish():202] Joined memory monitor
+2024-05-22 11:24:06,756 INFO    HandlerThread:151 [interfaces.py:finish():202] Joined network monitor
+2024-05-22 11:24:06,756 DEBUG   SenderThread:151 [sender.py:send():379] send: stats
+2024-05-22 11:24:09,757 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: status_report
+2024-05-22 11:24:14,758 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: status_report
+2024-05-22 11:24:16,883 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: stop_status
+2024-05-22 11:24:16,884 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: internal_messages
+2024-05-22 11:24:16,884 DEBUG   SenderThread:151 [sender.py:send_request():406] send_request: stop_status
+2024-05-22 11:24:20,045 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: status_report
+2024-05-22 11:24:21,664 DEBUG   SenderThread:151 [sender.py:send():379] send: exit
+2024-05-22 11:24:21,664 INFO    SenderThread:151 [sender.py:send_exit():586] handling exit code: 0
+2024-05-22 11:24:21,664 INFO    SenderThread:151 [sender.py:send_exit():588] handling runtime: 23
+2024-05-22 11:24:21,666 INFO    SenderThread:151 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
+2024-05-22 11:24:21,666 INFO    SenderThread:151 [sender.py:send_exit():594] send defer
+2024-05-22 11:24:21,666 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: defer
+2024-05-22 11:24:21,667 INFO    HandlerThread:151 [handler.py:handle_request_defer():172] handle defer: 0
+2024-05-22 11:24:21,667 DEBUG   SenderThread:151 [sender.py:send_request():406] send_request: defer
+2024-05-22 11:24:21,667 INFO    SenderThread:151 [sender.py:send_request_defer():610] handle sender defer: 0
+2024-05-22 11:24:21,667 INFO    SenderThread:151 [sender.py:transition_state():614] send defer: 1
+2024-05-22 11:24:21,667 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: defer
+2024-05-22 11:24:21,667 INFO    HandlerThread:151 [handler.py:handle_request_defer():172] handle defer: 1
+2024-05-22 11:24:21,668 DEBUG   SenderThread:151 [sender.py:send_request():406] send_request: defer
+2024-05-22 11:24:21,668 INFO    SenderThread:151 [sender.py:send_request_defer():610] handle sender defer: 1
+2024-05-22 11:24:21,668 INFO    SenderThread:151 [sender.py:transition_state():614] send defer: 2
+2024-05-22 11:24:21,668 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: defer
+2024-05-22 11:24:21,668 INFO    HandlerThread:151 [handler.py:handle_request_defer():172] handle defer: 2
+2024-05-22 11:24:21,668 DEBUG   SenderThread:151 [sender.py:send_request():406] send_request: defer
+2024-05-22 11:24:21,668 INFO    SenderThread:151 [sender.py:send_request_defer():610] handle sender defer: 2
+2024-05-22 11:24:21,668 INFO    SenderThread:151 [sender.py:transition_state():614] send defer: 3
+2024-05-22 11:24:21,668 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: defer
+2024-05-22 11:24:21,668 INFO    HandlerThread:151 [handler.py:handle_request_defer():172] handle defer: 3
+2024-05-22 11:24:21,669 DEBUG   SenderThread:151 [sender.py:send_request():406] send_request: defer
+2024-05-22 11:24:21,669 INFO    SenderThread:151 [sender.py:send_request_defer():610] handle sender defer: 3
+2024-05-22 11:24:21,669 INFO    SenderThread:151 [sender.py:transition_state():614] send defer: 4
+2024-05-22 11:24:21,669 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: defer
+2024-05-22 11:24:21,669 INFO    HandlerThread:151 [handler.py:handle_request_defer():172] handle defer: 4
+2024-05-22 11:24:21,669 DEBUG   SenderThread:151 [sender.py:send_request():406] send_request: defer
+2024-05-22 11:24:21,669 INFO    SenderThread:151 [sender.py:send_request_defer():610] handle sender defer: 4
+2024-05-22 11:24:21,669 INFO    SenderThread:151 [sender.py:transition_state():614] send defer: 5
+2024-05-22 11:24:21,669 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: defer
+2024-05-22 11:24:21,669 INFO    HandlerThread:151 [handler.py:handle_request_defer():172] handle defer: 5
+2024-05-22 11:24:21,670 DEBUG   SenderThread:151 [sender.py:send():379] send: summary
+2024-05-22 11:24:21,670 INFO    SenderThread:151 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
+2024-05-22 11:24:21,670 DEBUG   SenderThread:151 [sender.py:send_request():406] send_request: defer
+2024-05-22 11:24:21,670 INFO    SenderThread:151 [sender.py:send_request_defer():610] handle sender defer: 5
+2024-05-22 11:24:21,670 INFO    SenderThread:151 [sender.py:transition_state():614] send defer: 6
+2024-05-22 11:24:21,670 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: defer
+2024-05-22 11:24:21,670 INFO    HandlerThread:151 [handler.py:handle_request_defer():172] handle defer: 6
+2024-05-22 11:24:21,671 DEBUG   SenderThread:151 [sender.py:send_request():406] send_request: defer
+2024-05-22 11:24:21,671 INFO    SenderThread:151 [sender.py:send_request_defer():610] handle sender defer: 6
+2024-05-22 11:24:21,676 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: status_report
+2024-05-22 11:24:21,793 INFO    Thread-12 :151 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240522_112259-4b714brj/files/wandb-summary.json
+2024-05-22 11:24:21,885 INFO    SenderThread:151 [sender.py:transition_state():614] send defer: 7
+2024-05-22 11:24:21,885 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: defer
+2024-05-22 11:24:21,885 INFO    HandlerThread:151 [handler.py:handle_request_defer():172] handle defer: 7
+2024-05-22 11:24:21,885 DEBUG   SenderThread:151 [sender.py:send_request():406] send_request: defer
+2024-05-22 11:24:21,885 INFO    SenderThread:151 [sender.py:send_request_defer():610] handle sender defer: 7
+2024-05-22 11:24:22,664 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: poll_exit
+2024-05-22 11:24:22,794 INFO    Thread-12 :151 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240522_112259-4b714brj/files/config.yaml
+2024-05-22 11:24:23,767 INFO    SenderThread:151 [sender.py:transition_state():614] send defer: 8
+2024-05-22 11:24:23,768 DEBUG   SenderThread:151 [sender.py:send_request():406] send_request: poll_exit
+2024-05-22 11:24:23,768 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: defer
+2024-05-22 11:24:23,768 INFO    HandlerThread:151 [handler.py:handle_request_defer():172] handle defer: 8
+2024-05-22 11:24:23,769 DEBUG   SenderThread:151 [sender.py:send_request():406] send_request: defer
+2024-05-22 11:24:23,769 INFO    SenderThread:151 [sender.py:send_request_defer():610] handle sender defer: 8
+2024-05-22 11:24:23,769 INFO    SenderThread:151 [job_builder.py:build():318] Attempting to build job artifact
+2024-05-22 11:24:23,772 INFO    SenderThread:151 [job_builder.py:_get_source_type():466] no source found
+2024-05-22 11:24:23,772 INFO    SenderThread:151 [sender.py:transition_state():614] send defer: 9
+2024-05-22 11:24:23,773 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: defer
+2024-05-22 11:24:23,773 INFO    HandlerThread:151 [handler.py:handle_request_defer():172] handle defer: 9
+2024-05-22 11:24:23,773 DEBUG   SenderThread:151 [sender.py:send_request():406] send_request: defer
+2024-05-22 11:24:23,773 INFO    SenderThread:151 [sender.py:send_request_defer():610] handle sender defer: 9
+2024-05-22 11:24:23,773 INFO    SenderThread:151 [dir_watcher.py:finish():358] shutting down directory watcher
+2024-05-22 11:24:23,794 INFO    Thread-12 :151 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240522_112259-4b714brj/files/output.log
+2024-05-22 11:24:23,795 INFO    SenderThread:151 [dir_watcher.py:finish():388] scan: /kaggle/working/wandb/run-20240522_112259-4b714brj/files
+2024-05-22 11:24:23,795 INFO    SenderThread:151 [dir_watcher.py:finish():402] scan save: /kaggle/working/wandb/run-20240522_112259-4b714brj/files/output.log output.log
+2024-05-22 11:24:23,795 INFO    SenderThread:151 [dir_watcher.py:finish():402] scan save: /kaggle/working/wandb/run-20240522_112259-4b714brj/files/config.yaml config.yaml
+2024-05-22 11:24:23,800 INFO    SenderThread:151 [dir_watcher.py:finish():402] scan save: /kaggle/working/wandb/run-20240522_112259-4b714brj/files/wandb-metadata.json wandb-metadata.json
+2024-05-22 11:24:23,800 INFO    SenderThread:151 [dir_watcher.py:finish():402] scan save: /kaggle/working/wandb/run-20240522_112259-4b714brj/files/conda-environment.yaml conda-environment.yaml
+2024-05-22 11:24:23,801 INFO    SenderThread:151 [dir_watcher.py:finish():402] scan save: /kaggle/working/wandb/run-20240522_112259-4b714brj/files/wandb-summary.json wandb-summary.json
+2024-05-22 11:24:23,803 INFO    SenderThread:151 [dir_watcher.py:finish():402] scan save: /kaggle/working/wandb/run-20240522_112259-4b714brj/files/requirements.txt requirements.txt
+2024-05-22 11:24:23,807 INFO    SenderThread:151 [sender.py:transition_state():614] send defer: 10
+2024-05-22 11:24:23,807 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: defer
+2024-05-22 11:24:23,807 INFO    HandlerThread:151 [handler.py:handle_request_defer():172] handle defer: 10
+2024-05-22 11:24:23,811 DEBUG   SenderThread:151 [sender.py:send_request():406] send_request: defer
+2024-05-22 11:24:23,811 INFO    SenderThread:151 [sender.py:send_request_defer():610] handle sender defer: 10
+2024-05-22 11:24:23,811 INFO    SenderThread:151 [file_pusher.py:finish():172] shutting down file pusher
+2024-05-22 11:24:24,020 INFO    wandb-upload_1:151 [upload_job.py:push():131] Uploaded file /kaggle/working/wandb/run-20240522_112259-4b714brj/files/output.log
+2024-05-22 11:24:24,062 INFO    wandb-upload_3:151 [upload_job.py:push():131] Uploaded file /kaggle/working/wandb/run-20240522_112259-4b714brj/files/wandb-summary.json
+2024-05-22 11:24:24,063 INFO    wandb-upload_0:151 [upload_job.py:push():131] Uploaded file /kaggle/working/wandb/run-20240522_112259-4b714brj/files/config.yaml
+2024-05-22 11:24:24,065 INFO    wandb-upload_2:151 [upload_job.py:push():131] Uploaded file /kaggle/working/wandb/run-20240522_112259-4b714brj/files/requirements.txt
+2024-05-22 11:24:24,265 INFO    Thread-11 (_thread_body):151 [sender.py:transition_state():614] send defer: 11
+2024-05-22 11:24:24,266 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: defer
+2024-05-22 11:24:24,266 INFO    HandlerThread:151 [handler.py:handle_request_defer():172] handle defer: 11
+2024-05-22 11:24:24,267 DEBUG   SenderThread:151 [sender.py:send_request():406] send_request: defer
+2024-05-22 11:24:24,267 INFO    SenderThread:151 [sender.py:send_request_defer():610] handle sender defer: 11
+2024-05-22 11:24:24,267 INFO    SenderThread:151 [file_pusher.py:join():178] waiting for file pusher
+2024-05-22 11:24:24,267 INFO    SenderThread:151 [sender.py:transition_state():614] send defer: 12
+2024-05-22 11:24:24,267 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: defer
+2024-05-22 11:24:24,267 INFO    HandlerThread:151 [handler.py:handle_request_defer():172] handle defer: 12
+2024-05-22 11:24:24,267 DEBUG   SenderThread:151 [sender.py:send_request():406] send_request: defer
+2024-05-22 11:24:24,268 INFO    SenderThread:151 [sender.py:send_request_defer():610] handle sender defer: 12
+2024-05-22 11:24:24,268 INFO    SenderThread:151 [file_stream.py:finish():614] file stream finish called
+2024-05-22 11:24:24,475 INFO    SenderThread:151 [file_stream.py:finish():618] file stream finish is done
+2024-05-22 11:24:24,475 INFO    SenderThread:151 [sender.py:transition_state():614] send defer: 13
+2024-05-22 11:24:24,475 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: defer
+2024-05-22 11:24:24,475 INFO    HandlerThread:151 [handler.py:handle_request_defer():172] handle defer: 13
+2024-05-22 11:24:24,475 DEBUG   SenderThread:151 [sender.py:send_request():406] send_request: defer
+2024-05-22 11:24:24,476 INFO    SenderThread:151 [sender.py:send_request_defer():610] handle sender defer: 13
+2024-05-22 11:24:24,476 INFO    SenderThread:151 [sender.py:transition_state():614] send defer: 14
+2024-05-22 11:24:24,476 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: defer
+2024-05-22 11:24:24,476 INFO    HandlerThread:151 [handler.py:handle_request_defer():172] handle defer: 14
+2024-05-22 11:24:24,476 DEBUG   SenderThread:151 [sender.py:send():379] send: final
+2024-05-22 11:24:24,477 DEBUG   SenderThread:151 [sender.py:send():379] send: footer
+2024-05-22 11:24:24,477 DEBUG   SenderThread:151 [sender.py:send_request():406] send_request: defer
+2024-05-22 11:24:24,477 INFO    SenderThread:151 [sender.py:send_request_defer():610] handle sender defer: 14
+2024-05-22 11:24:24,478 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: poll_exit
+2024-05-22 11:24:24,479 DEBUG   SenderThread:151 [sender.py:send_request():406] send_request: poll_exit
+2024-05-22 11:24:24,479 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: poll_exit
+2024-05-22 11:24:24,479 DEBUG   SenderThread:151 [sender.py:send_request():406] send_request: poll_exit
+2024-05-22 11:24:24,480 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: server_info
+2024-05-22 11:24:24,480 DEBUG   SenderThread:151 [sender.py:send_request():406] send_request: server_info
+2024-05-22 11:24:24,483 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: get_summary
+2024-05-22 11:24:24,484 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: sampled_history
+2024-05-22 11:24:24,484 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: internal_messages
+2024-05-22 11:24:24,548 INFO    MainThread:151 [wandb_run.py:_footer_history_summary_info():3936] rendering history
+2024-05-22 11:24:24,548 INFO    MainThread:151 [wandb_run.py:_footer_history_summary_info():3968] rendering summary
+2024-05-22 11:24:24,549 INFO    MainThread:151 [wandb_run.py:_footer_sync_info():3895] logging synced files
+2024-05-22 11:24:24,549 DEBUG   HandlerThread:151 [handler.py:handle_request():146] handle_request: shutdown
+2024-05-22 11:24:24,549 INFO    HandlerThread:151 [handler.py:finish():866] shutting down handler
+2024-05-22 11:24:25,480 INFO    WriterThread:151 [datastore.py:close():296] close: /kaggle/working/wandb/run-20240522_112259-4b714brj/run-4b714brj.wandb
+2024-05-22 11:24:25,548 INFO    SenderThread:151 [sender.py:finish():1546] shutting down sender
+2024-05-22 11:24:25,548 INFO    SenderThread:151 [file_pusher.py:finish():172] shutting down file pusher
+2024-05-22 11:24:25,549 INFO    SenderThread:151 [file_pusher.py:join():178] waiting for file pusher

wandb/run-20240522_112259-4b714brj/logs/debug.log ADDED Viewed

	@@ -0,0 +1,48 @@

+2024-05-22 11:22:59,893 INFO    MainThread:34 [wandb_setup.py:_flush():76] Current SDK version is 0.16.6
+2024-05-22 11:22:59,893 INFO    MainThread:34 [wandb_setup.py:_flush():76] Configure stats pid to 34
+2024-05-22 11:22:59,893 INFO    MainThread:34 [wandb_setup.py:_flush():76] Loading settings from /root/.config/wandb/settings
+2024-05-22 11:22:59,893 INFO    MainThread:34 [wandb_setup.py:_flush():76] Loading settings from /kaggle/working/wandb/settings
+2024-05-22 11:22:59,893 INFO    MainThread:34 [wandb_setup.py:_flush():76] Loading settings from environment variables: {}
+2024-05-22 11:22:59,893 INFO    MainThread:34 [wandb_setup.py:_flush():76] Applying setup settings: {'_disable_service': False}
+2024-05-22 11:22:59,893 INFO    MainThread:34 [wandb_setup.py:_flush():76] Inferring run settings from compute environment: {'program': '<python with no main file>'}
+2024-05-22 11:22:59,893 INFO    MainThread:34 [wandb_setup.py:_flush():76] Applying login settings: {}
+2024-05-22 11:22:59,893 INFO    MainThread:34 [wandb_setup.py:_flush():76] Applying login settings: {'api_key': '***REDACTED***'}
+2024-05-22 11:22:59,893 INFO    MainThread:34 [wandb_init.py:_log_setup():521] Logging user logs to /kaggle/working/wandb/run-20240522_112259-4b714brj/logs/debug.log
+2024-05-22 11:22:59,893 INFO    MainThread:34 [wandb_init.py:_log_setup():522] Logging internal logs to /kaggle/working/wandb/run-20240522_112259-4b714brj/logs/debug-internal.log
+2024-05-22 11:22:59,893 INFO    MainThread:34 [wandb_init.py:_jupyter_setup():467] configuring jupyter hooks <wandb.sdk.wandb_init._WandbInit object at 0x78d7123ff250>
+2024-05-22 11:22:59,894 INFO    MainThread:34 [wandb_init.py:init():561] calling init triggers
+2024-05-22 11:22:59,894 INFO    MainThread:34 [wandb_init.py:init():568] wandb.init called with sweep_config: {}
+config: {}
+2024-05-22 11:22:59,894 INFO    MainThread:34 [wandb_init.py:init():611] starting backend
+2024-05-22 11:22:59,894 INFO    MainThread:34 [wandb_init.py:init():615] setting up manager
+2024-05-22 11:22:59,896 INFO    MainThread:34 [backend.py:_multiprocessing_setup():105] multiprocessing start_methods=fork,spawn,forkserver, using: spawn
+2024-05-22 11:22:59,899 INFO    MainThread:34 [wandb_init.py:init():623] backend started and connected
+2024-05-22 11:22:59,913 INFO    MainThread:34 [wandb_run.py:_label_probe_notebook():1299] probe notebook
+2024-05-22 11:23:00,554 INFO    MainThread:34 [wandb_init.py:init():715] updated telemetry
+2024-05-22 11:23:00,559 INFO    MainThread:34 [wandb_init.py:init():748] communicating run to backend with 90.0 second timeout
+2024-05-22 11:23:00,766 INFO    MainThread:34 [wandb_run.py:_on_init():2357] communicating current version
+2024-05-22 11:23:00,835 INFO    MainThread:34 [wandb_run.py:_on_init():2366] got version response upgrade_message: "wandb version 0.17.0 is available!  To upgrade, please run:\n $ pip install wandb --upgrade"
+2024-05-22 11:23:00,836 INFO    MainThread:34 [wandb_init.py:init():799] starting run threads in backend
+2024-05-22 11:23:16,882 INFO    MainThread:34 [wandb_run.py:_console_start():2335] atexit reg
+2024-05-22 11:23:16,883 INFO    MainThread:34 [wandb_run.py:_redirect():2190] redirect: wrap_raw
+2024-05-22 11:23:16,883 INFO    MainThread:34 [wandb_run.py:_redirect():2255] Wrapping output streams.
+2024-05-22 11:23:16,883 INFO    MainThread:34 [wandb_run.py:_redirect():2280] Redirects installed.
+2024-05-22 11:23:16,885 INFO    MainThread:34 [wandb_init.py:init():842] run started, returning control to user process
+2024-05-22 11:23:16,891 INFO    MainThread:34 [wandb_run.py:_config_callback():1347] config_cb None None {'peft_config': {'default': {'peft_type': <PeftType.LORA: 'LORA'>, 'auto_mapping': None, 'base_model_name_or_path': 'core42/jais-13b', 'revision': None, 'task_type': 'CAUSAL_LM', 'inference_mode': False, 'r': 16, 'target_modules': {'c_attn'}, 'lora_alpha': 32, 'lora_dropout': 0.05, 'fan_in_fan_out': False, 'bias': 'none', 'use_rslora': False, 'modules_to_save': None, 'init_lora_weights': True, 'layers_to_transform': None, 'layers_pattern': None, 'rank_pattern': {}, 'alpha_pattern': {}, 'megatron_config': None, 'megatron_core': 'megatron.core', 'loftq_config': {}, 'use_dora': False, 'layer_replication': None}}, 'vocab_size': 84992, 'n_positions': 2048, 'n_embd': 5120, 'n_layer': 40, 'n_head': 40, 'n_inner': 13653, 'activation_function': 'swiglu', 'resid_pdrop': 0.0, 'embd_pdrop': 0.0, 'attn_pdrop': 0.0, 'layer_norm_epsilon': 1e-05, 'initializer_range': 0.02, 'scale_attn_weights': True, 'use_cache': False, 'scale_attn_by_inverse_layer_idx': False, 'reorder_and_upcast_attn': False, 'bos_token_id': 0, 'eos_token_id': 0, 'position_embedding_type': 'alibi', 'width_scale': 0.11100000000000002, 'embeddings_scale': 14.6, 'scale_qk_dot_by_d': True, 'return_dict': True, 'output_hidden_states': False, 'output_attentions': False, 'torchscript': False, 'torch_dtype': 'float32', 'use_bfloat16': False, 'tf_legacy_loss': False, 'pruned_heads': {}, 'tie_word_embeddings': True, 'chunk_size_feed_forward': 0, 'is_encoder_decoder': False, 'is_decoder': False, 'cross_attention_hidden_size': None, 'add_cross_attention': False, 'tie_encoder_decoder': False, 'max_length': 20, 'min_length': 0, 'do_sample': False, 'early_stopping': False, 'num_beams': 1, 'num_beam_groups': 1, 'diversity_penalty': 0.0, 'temperature': 1.0, 'top_k': 50, 'top_p': 1.0, 'typical_p': 1.0, 'repetition_penalty': 1.0, 'length_penalty': 1.0, 'no_repeat_ngram_size': 0, 'encoder_no_repeat_ngram_size': 0, 'bad_words_ids': None, 'num_return_sequences': 1, 'output_scores': False, 'return_dict_in_generate': False, 'forced_bos_token_id': None, 'forced_eos_token_id': None, 'remove_invalid_values': False, 'exponential_decay_length_penalty': None, 'suppress_tokens': None, 'begin_suppress_tokens': None, 'architectures': ['JAISLMHeadModel'], 'finetuning_task': None, 'id2label': {0: 'LABEL_0', 1: 'LABEL_1'}, 'label2id': {'LABEL_0': 0, 'LABEL_1': 1}, 'tokenizer_class': None, 'prefix': None, 'pad_token_id': 0, 'sep_token_id': None, 'decoder_start_token_id': None, 'task_specific_params': None, 'problem_type': None, '_name_or_path': 'core42/jais-13b', 'transformers_version': '4.41.0', 'auto_map': {'AutoConfig': 'core42/jais-13b--configuration_jais.JAISConfig', 'AutoModel': 'core42/jais-13b--modeling_jais.JAISModel', 'AutoModelForCausalLM': 'core42/jais-13b--modeling_jais.JAISLMHeadModel', 'AutoModelForQuestionAnswering': 'core42/jais-13b--modeling_jais.JAISForQuestionAnswering', 'AutoModelForSequenceClassification': 'core42/jais-13b--modeling_jais.JAISForSequenceClassification', 'AutoModelForTokenClassification': 'core42/jais-13b--modeling_jais.JAISForTokenClassification'}, 'model_type': 'jais', 'quantization_config': {'quant_method': 'QuantizationMethod.BITS_AND_BYTES', '_load_in_8bit': False, '_load_in_4bit': True, 'llm_int8_threshold': 6.0, 'llm_int8_skip_modules': None, 'llm_int8_enable_fp32_cpu_offload': False, 'llm_int8_has_fp16_weight': False, 'bnb_4bit_quant_type': 'nf4', 'bnb_4bit_use_double_quant': False, 'bnb_4bit_compute_dtype': 'bfloat16', 'bnb_4bit_quant_storage': 'uint8', 'load_in_4bit': True, 'load_in_8bit': False}, 'output_dir': '/kaggle/working/', 'overwrite_output_dir': False, 'do_train': False, 'do_eval': False, 'do_predict': False, 'eval_strategy': 'no', 'prediction_loss_only': False, 'per_device_train_batch_size': 8, 'per_device_eval_batch_size': 8, 'per_gpu_train_batch_size': None, 'per_gpu_eval_batch_size': None, 'gradient_accumulation_steps': 1, 'eval_accumulation_steps': None, 'eval_delay': 0, 'learning_rate': 0.0002, 'weight_decay': 0.0, 'adam_beta1': 0.9, 'adam_beta2': 0.999, 'adam_epsilon': 1e-08, 'max_grad_norm': 1.0, 'num_train_epochs': 2, 'max_steps': -1, 'lr_scheduler_type': 'linear', 'lr_scheduler_kwargs': {}, 'warmup_ratio': 0.0, 'warmup_steps': 0, 'log_level': 'passive', 'log_level_replica': 'warning', 'log_on_each_node': True, 'logging_dir': '/kaggle/working/runs/May22_11-21-59_2c1b614ec68f', 'logging_strategy': 'steps', 'logging_first_step': False, 'logging_steps': 10, 'logging_nan_inf_filter': True, 'save_strategy': 'epoch', 'save_steps': 500, 'save_total_limit': 4, 'save_safetensors': True, 'save_on_each_node': False, 'save_only_model': False, 'restore_callback_states_from_checkpoint': False, 'no_cuda': False, 'use_cpu': False, 'use_mps_device': False, 'seed': 42, 'data_seed': None, 'jit_mode_eval': False, 'use_ipex': False, 'bf16': True, 'fp16': False, 'fp16_opt_level': 'O1', 'half_precision_backend': 'auto', 'bf16_full_eval': False, 'fp16_full_eval': False, 'tf32': None, 'local_rank': 0, 'ddp_backend': None, 'tpu_num_cores': None, 'tpu_metrics_debug': False, 'debug': [], 'dataloader_drop_last': False, 'eval_steps': None, 'dataloader_num_workers': 0, 'dataloader_prefetch_factor': None, 'past_index': -1, 'run_name': '/kaggle/working/', 'disable_tqdm': False, 'remove_unused_columns': True, 'label_names': None, 'load_best_model_at_end': False, 'metric_for_best_model': None, 'greater_is_better': None, 'ignore_data_skip': False, 'fsdp': [], 'fsdp_min_num_params': 0, 'fsdp_config': {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}, 'fsdp_transformer_layer_cls_to_wrap': None, 'accelerator_config': {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}, 'deepspeed': None, 'label_smoothing_factor': 0.0, 'optim': 'adamw_torch', 'optim_args': None, 'adafactor': False, 'group_by_length': False, 'length_column_name': 'length', 'report_to': ['tensorboard', 'wandb'], 'ddp_find_unused_parameters': None, 'ddp_bucket_cap_mb': None, 'ddp_broadcast_buffers': None, 'dataloader_pin_memory': True, 'dataloader_persistent_workers': False, 'skip_memory_metrics': True, 'use_legacy_prediction_loop': False, 'push_to_hub': False, 'resume_from_checkpoint': None, 'hub_model_id': None, 'hub_strategy': 'every_save', 'hub_token': '<HUB_TOKEN>', 'hub_private_repo': False, 'hub_always_push': False, 'gradient_checkpointing': False, 'gradient_checkpointing_kwargs': None, 'include_inputs_for_metrics': False, 'eval_do_concat_batches': True, 'fp16_backend': 'auto', 'evaluation_strategy': None, 'push_to_hub_model_id': None, 'push_to_hub_organization': None, 'push_to_hub_token': '<PUSH_TO_HUB_TOKEN>', 'mp_parameters': '', 'auto_find_batch_size': True, 'full_determinism': False, 'torchdynamo': None, 'ray_scope': 'last', 'ddp_timeout': 1800, 'torch_compile': False, 'torch_compile_backend': None, 'torch_compile_mode': None, 'dispatch_batches': None, 'split_batches': None, 'include_tokens_per_second': False, 'include_num_input_tokens_seen': False, 'neftune_noise_alpha': None, 'optim_target_modules': None, 'batch_eval_metrics': False}
+2024-05-22 11:23:16,902 INFO    MainThread:34 [wandb_config.py:__setitem__():151] config set model/num_parameters = 13033919160 - <bound method Run._config_callback of <wandb.sdk.wandb_run.Run object at 0x78d6d5269d80>>
+2024-05-22 11:23:16,903 INFO    MainThread:34 [wandb_run.py:_config_callback():1347] config_cb model/num_parameters 13033919160 None
+2024-05-22 11:23:20,546 INFO    MainThread:34 [jupyter.py:save_ipynb():373] not saving jupyter notebook
+2024-05-22 11:23:20,547 INFO    MainThread:34 [wandb_init.py:_pause_backend():432] pausing backend
+2024-05-22 11:23:45,679 INFO    MainThread:34 [wandb_init.py:_resume_backend():437] resuming backend
+2024-05-22 11:23:45,683 INFO    MainThread:34 [jupyter.py:save_ipynb():373] not saving jupyter notebook
+2024-05-22 11:23:45,683 INFO    MainThread:34 [wandb_init.py:_pause_backend():432] pausing backend
+2024-05-22 11:23:52,143 INFO    MainThread:34 [wandb_init.py:_resume_backend():437] resuming backend
+2024-05-22 11:23:52,145 INFO    MainThread:34 [jupyter.py:save_ipynb():373] not saving jupyter notebook
+2024-05-22 11:23:52,145 INFO    MainThread:34 [wandb_init.py:_pause_backend():432] pausing backend
+2024-05-22 11:24:02,823 INFO    MainThread:34 [wandb_init.py:_resume_backend():437] resuming backend
+2024-05-22 11:24:05,592 INFO    MainThread:34 [wandb_run.py:_config_callback():1347] config_cb None None {'peft_config': {'default': {'peft_type': <PeftType.LORA: 'LORA'>, 'auto_mapping': None, 'base_model_name_or_path': 'core42/jais-13b', 'revision': None, 'task_type': 'CAUSAL_LM', 'inference_mode': False, 'r': 16, 'target_modules': {'c_attn'}, 'lora_alpha': 32, 'lora_dropout': 0.05, 'fan_in_fan_out': False, 'bias': 'none', 'use_rslora': False, 'modules_to_save': None, 'init_lora_weights': True, 'layers_to_transform': None, 'layers_pattern': None, 'rank_pattern': {}, 'alpha_pattern': {}, 'megatron_config': None, 'megatron_core': 'megatron.core', 'loftq_config': {}, 'use_dora': False, 'layer_replication': None}}, 'vocab_size': 84992, 'n_positions': 2048, 'n_embd': 5120, 'n_layer': 40, 'n_head': 40, 'n_inner': 13653, 'activation_function': 'swiglu', 'resid_pdrop': 0.0, 'embd_pdrop': 0.0, 'attn_pdrop': 0.0, 'layer_norm_epsilon': 1e-05, 'initializer_range': 0.02, 'scale_attn_weights': True, 'use_cache': False, 'scale_attn_by_inverse_layer_idx': False, 'reorder_and_upcast_attn': False, 'bos_token_id': 0, 'eos_token_id': 0, 'position_embedding_type': 'alibi', 'width_scale': 0.11100000000000002, 'embeddings_scale': 14.6, 'scale_qk_dot_by_d': True, 'return_dict': True, 'output_hidden_states': False, 'output_attentions': False, 'torchscript': False, 'torch_dtype': 'float32', 'use_bfloat16': False, 'tf_legacy_loss': False, 'pruned_heads': {}, 'tie_word_embeddings': True, 'chunk_size_feed_forward': 0, 'is_encoder_decoder': False, 'is_decoder': False, 'cross_attention_hidden_size': None, 'add_cross_attention': False, 'tie_encoder_decoder': False, 'max_length': 20, 'min_length': 0, 'do_sample': False, 'early_stopping': False, 'num_beams': 1, 'num_beam_groups': 1, 'diversity_penalty': 0.0, 'temperature': 1.0, 'top_k': 50, 'top_p': 1.0, 'typical_p': 1.0, 'repetition_penalty': 1.0, 'length_penalty': 1.0, 'no_repeat_ngram_size': 0, 'encoder_no_repeat_ngram_size': 0, 'bad_words_ids': None, 'num_return_sequences': 1, 'output_scores': False, 'return_dict_in_generate': False, 'forced_bos_token_id': None, 'forced_eos_token_id': None, 'remove_invalid_values': False, 'exponential_decay_length_penalty': None, 'suppress_tokens': None, 'begin_suppress_tokens': None, 'architectures': ['JAISLMHeadModel'], 'finetuning_task': None, 'id2label': {0: 'LABEL_0', 1: 'LABEL_1'}, 'label2id': {'LABEL_0': 0, 'LABEL_1': 1}, 'tokenizer_class': None, 'prefix': None, 'pad_token_id': 0, 'sep_token_id': None, 'decoder_start_token_id': None, 'task_specific_params': None, 'problem_type': None, '_name_or_path': 'core42/jais-13b', 'transformers_version': '4.41.0', 'auto_map': {'AutoConfig': 'core42/jais-13b--configuration_jais.JAISConfig', 'AutoModel': 'core42/jais-13b--modeling_jais.JAISModel', 'AutoModelForCausalLM': 'core42/jais-13b--modeling_jais.JAISLMHeadModel', 'AutoModelForQuestionAnswering': 'core42/jais-13b--modeling_jais.JAISForQuestionAnswering', 'AutoModelForSequenceClassification': 'core42/jais-13b--modeling_jais.JAISForSequenceClassification', 'AutoModelForTokenClassification': 'core42/jais-13b--modeling_jais.JAISForTokenClassification'}, 'model_type': 'jais', 'quantization_config': {'quant_method': 'QuantizationMethod.BITS_AND_BYTES', '_load_in_8bit': False, '_load_in_4bit': True, 'llm_int8_threshold': 6.0, 'llm_int8_skip_modules': None, 'llm_int8_enable_fp32_cpu_offload': False, 'llm_int8_has_fp16_weight': False, 'bnb_4bit_quant_type': 'nf4', 'bnb_4bit_use_double_quant': False, 'bnb_4bit_compute_dtype': 'bfloat16', 'bnb_4bit_quant_storage': 'uint8', 'load_in_4bit': True, 'load_in_8bit': False}, 'output_dir': '/kaggle/working/', 'overwrite_output_dir': False, 'do_train': False, 'do_eval': False, 'do_predict': False, 'eval_strategy': 'no', 'prediction_loss_only': False, 'per_device_train_batch_size': 8, 'per_device_eval_batch_size': 8, 'per_gpu_train_batch_size': None, 'per_gpu_eval_batch_size': None, 'gradient_accumulation_steps': 1, 'eval_accumulation_steps': None, 'eval_delay': 0, 'learning_rate': 0.0002, 'weight_decay': 0.0, 'adam_beta1': 0.9, 'adam_beta2': 0.999, 'adam_epsilon': 1e-08, 'max_grad_norm': 1.0, 'num_train_epochs': 2, 'max_steps': -1, 'lr_scheduler_type': 'linear', 'lr_scheduler_kwargs': {}, 'warmup_ratio': 0.0, 'warmup_steps': 0, 'log_level': 'passive', 'log_level_replica': 'warning', 'log_on_each_node': True, 'logging_dir': '/kaggle/working/runs/May22_11-21-59_2c1b614ec68f', 'logging_strategy': 'steps', 'logging_first_step': False, 'logging_steps': 10, 'logging_nan_inf_filter': True, 'save_strategy': 'epoch', 'save_steps': 500, 'save_total_limit': 4, 'save_safetensors': True, 'save_on_each_node': False, 'save_only_model': False, 'restore_callback_states_from_checkpoint': False, 'no_cuda': False, 'use_cpu': False, 'use_mps_device': False, 'seed': 42, 'data_seed': None, 'jit_mode_eval': False, 'use_ipex': False, 'bf16': True, 'fp16': False, 'fp16_opt_level': 'O1', 'half_precision_backend': 'auto', 'bf16_full_eval': False, 'fp16_full_eval': False, 'tf32': None, 'local_rank': 0, 'ddp_backend': None, 'tpu_num_cores': None, 'tpu_metrics_debug': False, 'debug': [], 'dataloader_drop_last': False, 'eval_steps': None, 'dataloader_num_workers': 0, 'dataloader_prefetch_factor': None, 'past_index': -1, 'run_name': '/kaggle/working/', 'disable_tqdm': False, 'remove_unused_columns': True, 'label_names': None, 'load_best_model_at_end': False, 'metric_for_best_model': None, 'greater_is_better': None, 'ignore_data_skip': False, 'fsdp': [], 'fsdp_min_num_params': 0, 'fsdp_config': {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}, 'fsdp_transformer_layer_cls_to_wrap': None, 'accelerator_config': {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}, 'deepspeed': None, 'label_smoothing_factor': 0.0, 'optim': 'adamw_torch', 'optim_args': None, 'adafactor': False, 'group_by_length': False, 'length_column_name': 'length', 'report_to': ['tensorboard', 'wandb'], 'ddp_find_unused_parameters': None, 'ddp_bucket_cap_mb': None, 'ddp_broadcast_buffers': None, 'dataloader_pin_memory': True, 'dataloader_persistent_workers': False, 'skip_memory_metrics': True, 'use_legacy_prediction_loop': False, 'push_to_hub': False, 'resume_from_checkpoint': None, 'hub_model_id': None, 'hub_strategy': 'every_save', 'hub_token': '<HUB_TOKEN>', 'hub_private_repo': False, 'hub_always_push': False, 'gradient_checkpointing': False, 'gradient_checkpointing_kwargs': None, 'include_inputs_for_metrics': False, 'eval_do_concat_batches': True, 'fp16_backend': 'auto', 'evaluation_strategy': None, 'push_to_hub_model_id': None, 'push_to_hub_organization': None, 'push_to_hub_token': '<PUSH_TO_HUB_TOKEN>', 'mp_parameters': '', 'auto_find_batch_size': True, 'full_determinism': False, 'torchdynamo': None, 'ray_scope': 'last', 'ddp_timeout': 1800, 'torch_compile': False, 'torch_compile_backend': None, 'torch_compile_mode': None, 'dispatch_batches': None, 'split_batches': None, 'include_tokens_per_second': False, 'include_num_input_tokens_seen': False, 'neftune_noise_alpha': None, 'optim_target_modules': None, 'batch_eval_metrics': False}
+2024-05-22 11:24:05,601 INFO    MainThread:34 [wandb_config.py:__setitem__():151] config set model/num_parameters = 13033919160 - <bound method Run._config_callback of <wandb.sdk.wandb_run.Run object at 0x78d6d5269d80>>
+2024-05-22 11:24:05,602 INFO    MainThread:34 [wandb_run.py:_config_callback():1347] config_cb model/num_parameters 13033919160 None
+2024-05-22 11:24:06,747 INFO    MainThread:34 [jupyter.py:save_ipynb():373] not saving jupyter notebook
+2024-05-22 11:24:06,748 INFO    MainThread:34 [wandb_init.py:_pause_backend():432] pausing backend

wandb/run-20240522_112259-4b714brj/run-4b714brj.wandb ADDED Viewed

Binary file (18.9 kB). View file

wandb/run-20240522_113413-8mudzhjp/files/conda-environment.yaml ADDED Viewed

File without changes

wandb/run-20240522_113413-8mudzhjp/files/config.yaml ADDED Viewed

	@@ -0,0 +1,754 @@

+wandb_version: 1
+_wandb:
+  desc: null
+  value:
+    python_version: 3.10.13
+    cli_version: 0.16.6
+    framework: huggingface
+    huggingface_version: 4.41.0
+    is_jupyter_run: true
+    is_kaggle_kernel: true
+    start_time: 1716377654.0
+    t:
+      1:
+      - 1
+      - 2
+      - 3
+      - 5
+      - 11
+      - 12
+      - 49
+      - 51
+      - 53
+      - 55
+      - 71
+      - 98
+      - 105
+      2:
+      - 1
+      - 2
+      - 3
+      - 5
+      - 11
+      - 12
+      - 49
+      - 51
+      - 53
+      - 55
+      - 71
+      - 98
+      - 105
+      3:
+      - 7
+      - 13
+      - 19
+      - 23
+      - 62
+      4: 3.10.13
+      5: 0.16.6
+      6: 4.41.0
+      8:
+      - 1
+      - 2
+      - 5
+      9:
+        1: transformers_trainer
+      13: linux-x86_64
+    m:
+    - 1: train/global_step
+      6:
+      - 3
+    - 1: train/loss
+      5: 1
+      6:
+      - 1
+    - 1: train/grad_norm
+      5: 1
+      6:
+      - 1
+    - 1: train/learning_rate
+      5: 1
+      6:
+      - 1
+    - 1: train/epoch
+      5: 1
+      6:
+      - 1
+peft_config:
+  desc: null
+  value:
+    default:
+      peft_type: LORA
+      auto_mapping: null
+      base_model_name_or_path: core42/jais-13b
+      revision: null
+      task_type: CAUSAL_LM
+      inference_mode: false
+      r: 16
+      target_modules:
+      - c_attn
+      lora_alpha: 32
+      lora_dropout: 0.05
+      fan_in_fan_out: false
+      bias: none
+      use_rslora: false
+      modules_to_save: null
+      init_lora_weights: true
+      layers_to_transform: null
+      layers_pattern: null
+      rank_pattern: {}
+      alpha_pattern: {}
+      megatron_config: null
+      megatron_core: megatron.core
+      loftq_config: {}
+      use_dora: false
+      layer_replication: null
+vocab_size:
+  desc: null
+  value: 84992
+n_positions:
+  desc: null
+  value: 2048
+n_embd:
+  desc: null
+  value: 5120
+n_layer:
+  desc: null
+  value: 40
+n_head:
+  desc: null
+  value: 40
+n_inner:
+  desc: null
+  value: 13653
+activation_function:
+  desc: null
+  value: swiglu
+resid_pdrop:
+  desc: null
+  value: 0.0
+embd_pdrop:
+  desc: null
+  value: 0.0
+attn_pdrop:
+  desc: null
+  value: 0.0
+layer_norm_epsilon:
+  desc: null
+  value: 1.0e-05
+initializer_range:
+  desc: null
+  value: 0.02
+scale_attn_weights:
+  desc: null
+  value: true
+use_cache:
+  desc: null
+  value: false
+scale_attn_by_inverse_layer_idx:
+  desc: null
+  value: false
+reorder_and_upcast_attn:
+  desc: null
+  value: false
+bos_token_id:
+  desc: null
+  value: 0
+eos_token_id:
+  desc: null
+  value: 0
+position_embedding_type:
+  desc: null
+  value: alibi
+width_scale:
+  desc: null
+  value: 0.11100000000000002
+embeddings_scale:
+  desc: null
+  value: 14.6
+scale_qk_dot_by_d:
+  desc: null
+  value: true
+return_dict:
+  desc: null
+  value: true
+output_hidden_states:
+  desc: null
+  value: false
+output_attentions:
+  desc: null
+  value: false
+torchscript:
+  desc: null
+  value: false
+torch_dtype:
+  desc: null
+  value: float32
+use_bfloat16:
+  desc: null
+  value: false
+tf_legacy_loss:
+  desc: null
+  value: false
+pruned_heads:
+  desc: null
+  value: {}
+tie_word_embeddings:
+  desc: null
+  value: true
+chunk_size_feed_forward:
+  desc: null
+  value: 0
+is_encoder_decoder:
+  desc: null
+  value: false
+is_decoder:
+  desc: null
+  value: false
+cross_attention_hidden_size:
+  desc: null
+  value: null
+add_cross_attention:
+  desc: null
+  value: false
+tie_encoder_decoder:
+  desc: null
+  value: false
+max_length:
+  desc: null
+  value: 20
+min_length:
+  desc: null
+  value: 0
+do_sample:
+  desc: null
+  value: false
+early_stopping:
+  desc: null
+  value: false
+num_beams:
+  desc: null
+  value: 1
+num_beam_groups:
+  desc: null
+  value: 1
+diversity_penalty:
+  desc: null
+  value: 0.0
+temperature:
+  desc: null
+  value: 1.0
+top_k:
+  desc: null
+  value: 50
+top_p:
+  desc: null
+  value: 1.0
+typical_p:
+  desc: null
+  value: 1.0
+repetition_penalty:
+  desc: null
+  value: 1.0
+length_penalty:
+  desc: null
+  value: 1.0
+no_repeat_ngram_size:
+  desc: null
+  value: 0
+encoder_no_repeat_ngram_size:
+  desc: null
+  value: 0
+bad_words_ids:
+  desc: null
+  value: null
+num_return_sequences:
+  desc: null
+  value: 1
+output_scores:
+  desc: null
+  value: false
+return_dict_in_generate:
+  desc: null
+  value: false
+forced_bos_token_id:
+  desc: null
+  value: null
+forced_eos_token_id:
+  desc: null
+  value: null
+remove_invalid_values:
+  desc: null
+  value: false
+exponential_decay_length_penalty:
+  desc: null
+  value: null
+suppress_tokens:
+  desc: null
+  value: null
+begin_suppress_tokens:
+  desc: null
+  value: null
+architectures:
+  desc: null
+  value:
+  - JAISLMHeadModel
+finetuning_task:
+  desc: null
+  value: null
+id2label:
+  desc: null
+  value:
+    '0': LABEL_0
+    '1': LABEL_1
+label2id:
+  desc: null
+  value:
+    LABEL_0: 0
+    LABEL_1: 1
+tokenizer_class:
+  desc: null
+  value: null
+prefix:
+  desc: null
+  value: null
+pad_token_id:
+  desc: null
+  value: 0
+sep_token_id:
+  desc: null
+  value: null
+decoder_start_token_id:
+  desc: null
+  value: null
+task_specific_params:
+  desc: null
+  value: null
+problem_type:
+  desc: null
+  value: null
+_name_or_path:
+  desc: null
+  value: core42/jais-13b
+transformers_version:
+  desc: null
+  value: 4.41.0
+auto_map:
+  desc: null
+  value:
+    AutoConfig: core42/jais-13b--configuration_jais.JAISConfig
+    AutoModel: core42/jais-13b--modeling_jais.JAISModel
+    AutoModelForCausalLM: core42/jais-13b--modeling_jais.JAISLMHeadModel
+    AutoModelForQuestionAnswering: core42/jais-13b--modeling_jais.JAISForQuestionAnswering
+    AutoModelForSequenceClassification: core42/jais-13b--modeling_jais.JAISForSequenceClassification
+    AutoModelForTokenClassification: core42/jais-13b--modeling_jais.JAISForTokenClassification
+model_type:
+  desc: null
+  value: jais
+quantization_config:
+  desc: null
+  value:
+    quant_method: QuantizationMethod.BITS_AND_BYTES
+    _load_in_8bit: false
+    _load_in_4bit: true
+    llm_int8_threshold: 6.0
+    llm_int8_skip_modules: null
+    llm_int8_enable_fp32_cpu_offload: false
+    llm_int8_has_fp16_weight: false
+    bnb_4bit_quant_type: nf4
+    bnb_4bit_use_double_quant: false
+    bnb_4bit_compute_dtype: bfloat16
+    bnb_4bit_quant_storage: uint8
+    load_in_4bit: true
+    load_in_8bit: false
+output_dir:
+  desc: null
+  value: /kaggle/working/
+overwrite_output_dir:
+  desc: null
+  value: false
+do_train:
+  desc: null
+  value: false
+do_eval:
+  desc: null
+  value: false
+do_predict:
+  desc: null
+  value: false
+eval_strategy:
+  desc: null
+  value: 'no'
+prediction_loss_only:
+  desc: null
+  value: false
+per_device_train_batch_size:
+  desc: null
+  value: 8
+per_device_eval_batch_size:
+  desc: null
+  value: 8
+per_gpu_train_batch_size:
+  desc: null
+  value: null
+per_gpu_eval_batch_size:
+  desc: null
+  value: null
+gradient_accumulation_steps:
+  desc: null
+  value: 1
+eval_accumulation_steps:
+  desc: null
+  value: null
+eval_delay:
+  desc: null
+  value: 0
+learning_rate:
+  desc: null
+  value: 0.0002
+weight_decay:
+  desc: null
+  value: 0.0
+adam_beta1:
+  desc: null
+  value: 0.9
+adam_beta2:
+  desc: null
+  value: 0.999
+adam_epsilon:
+  desc: null
+  value: 1.0e-08
+max_grad_norm:
+  desc: null
+  value: 1.0
+num_train_epochs:
+  desc: null
+  value: 2
+max_steps:
+  desc: null
+  value: -1
+lr_scheduler_type:
+  desc: null
+  value: linear
+lr_scheduler_kwargs:
+  desc: null
+  value: {}
+warmup_ratio:
+  desc: null
+  value: 0.0
+warmup_steps:
+  desc: null
+  value: 0
+log_level:
+  desc: null
+  value: passive
+log_level_replica:
+  desc: null
+  value: warning
+log_on_each_node:
+  desc: null
+  value: true
+logging_dir:
+  desc: null
+  value: /kaggle/working/runs/May22_11-33-56_2c1b614ec68f
+logging_strategy:
+  desc: null
+  value: steps
+logging_first_step:
+  desc: null
+  value: false
+logging_steps:
+  desc: null
+  value: 10
+logging_nan_inf_filter:
+  desc: null
+  value: true
+save_strategy:
+  desc: null
+  value: epoch
+save_steps:
+  desc: null
+  value: 500
+save_total_limit:
+  desc: null
+  value: 4
+save_safetensors:
+  desc: null
+  value: true
+save_on_each_node:
+  desc: null
+  value: false
+save_only_model:
+  desc: null
+  value: false
+restore_callback_states_from_checkpoint:
+  desc: null
+  value: false
+no_cuda:
+  desc: null
+  value: false
+use_cpu:
+  desc: null
+  value: false
+use_mps_device:
+  desc: null
+  value: false
+seed:
+  desc: null
+  value: 42
+data_seed:
+  desc: null
+  value: null
+jit_mode_eval:
+  desc: null
+  value: false
+use_ipex:
+  desc: null
+  value: false
+bf16:
+  desc: null
+  value: true
+fp16:
+  desc: null
+  value: false
+fp16_opt_level:
+  desc: null
+  value: O1
+half_precision_backend:
+  desc: null
+  value: auto
+bf16_full_eval:
+  desc: null
+  value: false
+fp16_full_eval:
+  desc: null
+  value: false
+tf32:
+  desc: null
+  value: null
+local_rank:
+  desc: null
+  value: 0
+ddp_backend:
+  desc: null
+  value: null
+tpu_num_cores:
+  desc: null
+  value: null
+tpu_metrics_debug:
+  desc: null
+  value: false
+debug:
+  desc: null
+  value: []
+dataloader_drop_last:
+  desc: null
+  value: false
+eval_steps:
+  desc: null
+  value: null
+dataloader_num_workers:
+  desc: null
+  value: 0
+dataloader_prefetch_factor:
+  desc: null
+  value: null
+past_index:
+  desc: null
+  value: -1
+run_name:
+  desc: null
+  value: /kaggle/working/
+disable_tqdm:
+  desc: null
+  value: false
+remove_unused_columns:
+  desc: null
+  value: true
+label_names:
+  desc: null
+  value: null
+load_best_model_at_end:
+  desc: null
+  value: false
+metric_for_best_model:
+  desc: null
+  value: null
+greater_is_better:
+  desc: null
+  value: null
+ignore_data_skip:
+  desc: null
+  value: false
+fsdp:
+  desc: null
+  value: []
+fsdp_min_num_params:
+  desc: null
+  value: 0
+fsdp_config:
+  desc: null
+  value:
+    min_num_params: 0
+    xla: false
+    xla_fsdp_v2: false
+    xla_fsdp_grad_ckpt: false
+fsdp_transformer_layer_cls_to_wrap:
+  desc: null
+  value: null
+accelerator_config:
+  desc: null
+  value:
+    split_batches: false
+    dispatch_batches: null
+    even_batches: true
+    use_seedable_sampler: true
+    non_blocking: false
+    gradient_accumulation_kwargs: null
+deepspeed:
+  desc: null
+  value: null
+label_smoothing_factor:
+  desc: null
+  value: 0.0
+optim:
+  desc: null
+  value: adamw_torch
+optim_args:
+  desc: null
+  value: null
+adafactor:
+  desc: null
+  value: false
+group_by_length:
+  desc: null
+  value: false
+length_column_name:
+  desc: null
+  value: length
+report_to:
+  desc: null
+  value:
+  - tensorboard
+  - wandb
+ddp_find_unused_parameters:
+  desc: null
+  value: null
+ddp_bucket_cap_mb:
+  desc: null
+  value: null
+ddp_broadcast_buffers:
+  desc: null
+  value: null
+dataloader_pin_memory:
+  desc: null
+  value: true
+dataloader_persistent_workers:
+  desc: null
+  value: false
+skip_memory_metrics:
+  desc: null
+  value: true
+use_legacy_prediction_loop:
+  desc: null
+  value: false
+push_to_hub:
+  desc: null
+  value: false
+resume_from_checkpoint:
+  desc: null
+  value: null
+hub_model_id:
+  desc: null
+  value: null
+hub_strategy:
+  desc: null
+  value: every_save
+hub_token:
+  desc: null
+  value: <HUB_TOKEN>
+hub_private_repo:
+  desc: null
+  value: false
+hub_always_push:
+  desc: null
+  value: false
+gradient_checkpointing:
+  desc: null
+  value: false
+gradient_checkpointing_kwargs:
+  desc: null
+  value: null
+include_inputs_for_metrics:
+  desc: null
+  value: false
+eval_do_concat_batches:
+  desc: null
+  value: true
+fp16_backend:
+  desc: null
+  value: auto
+evaluation_strategy:
+  desc: null
+  value: null
+push_to_hub_model_id:
+  desc: null
+  value: null
+push_to_hub_organization:
+  desc: null
+  value: null
+push_to_hub_token:
+  desc: null
+  value: <PUSH_TO_HUB_TOKEN>
+mp_parameters:
+  desc: null
+  value: ''
+auto_find_batch_size:
+  desc: null
+  value: true
+full_determinism:
+  desc: null
+  value: false
+torchdynamo:
+  desc: null
+  value: null
+ray_scope:
+  desc: null
+  value: last
+ddp_timeout:
+  desc: null
+  value: 1800
+torch_compile:
+  desc: null
+  value: false
+torch_compile_backend:
+  desc: null
+  value: null
+torch_compile_mode:
+  desc: null
+  value: null
+dispatch_batches:
+  desc: null
+  value: null
+split_batches:
+  desc: null
+  value: null
+include_tokens_per_second:
+  desc: null
+  value: false
+include_num_input_tokens_seen:
+  desc: null
+  value: false
+neftune_noise_alpha:
+  desc: null
+  value: null
+optim_target_modules:
+  desc: null
+  value: null
+batch_eval_metrics:
+  desc: null
+  value: false
+model/num_parameters:
+  desc: null
+  value: 13033919160

wandb/run-20240522_113413-8mudzhjp/files/output.log ADDED Viewed

	@@ -0,0 +1,93 @@

+/opt/conda/lib/python3.10/site-packages/huggingface_hub/file_download.py:1132: FutureWarning: `resume_download` is deprecated and will be removed in version 1.0.0. Downloads always resume when possible. If you want to force a new download, use `force_download=True`.
+  warnings.warn(
+/opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:464: UserWarning: torch.utils.checkpoint: the use_reentrant parameter should be passed explicitly. In version 2.4 we will raise an exception if use_reentrant is not passed. use_reentrant=False is recommended, but if you need to preserve the current default behavior, you can pass use_reentrant=True. Refer to docs for more details on the differences between the two variants.
+  warnings.warn(
+/opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:464: UserWarning: torch.utils.checkpoint: the use_reentrant parameter should be passed explicitly. In version 2.4 we will raise an exception if use_reentrant is not passed. use_reentrant=False is recommended, but if you need to preserve the current default behavior, you can pass use_reentrant=True. Refer to docs for more details on the differences between the two variants.
+  warnings.warn(
+/opt/conda/lib/python3.10/site-packages/huggingface_hub/file_download.py:1132: FutureWarning: `resume_download` is deprecated and will be removed in version 1.0.0. Downloads always resume when possible. If you want to force a new download, use `force_download=True`.
+  warnings.warn(
+huggingface/tokenizers: The current process just got forked, after parallelism has already been used. Disabling parallelism to avoid deadlocks...
+To disable this warning, you can either:
+	- Avoid using `tokenizers` before the fork if possible
+	- Explicitly set the environment variable TOKENIZERS_PARALLELISM=(true | false)
+usage: huggingface-cli <command> [<args>]
+positional arguments:
+  {env,login,whoami,logout,repo,upload,download,lfs-enable-largefiles,lfs-multipart-upload,scan-cache,delete-cache,tag}
+                        huggingface-cli command helpers
+    env                 Print information about the environment.
+    login               Log in using a token from
+                        huggingface.co/settings/tokens
+    whoami              Find out which huggingface.co account you are logged
+                        in as.
+    logout              Log out
+    repo                {create} Commands to interact with your huggingface.co
+                        repos.
+    upload              Upload a file or a folder to a repo on the Hub
+    download            Download files from the Hub
+    lfs-enable-largefiles
+                        Configure your repository to enable upload of files >
+                        5GB.
+    scan-cache          Scan cache directory.
+    delete-cache        Delete revisions from the cache directory.
+    tag                 (create, list, delete) tags for a repo in the hub
+options:
+  -h, --help            show this help message and exit
+huggingface/tokenizers: The current process just got forked, after parallelism has already been used. Disabling parallelism to avoid deadlocks...
+To disable this warning, you can either:
+	- Avoid using `tokenizers` before the fork if possible
+	- Explicitly set the environment variable TOKENIZERS_PARALLELISM=(true | false)
+    _|    _|  _|    _|    _|_|_|    _|_|_|  _|_|_|  _|      _|    _|_|_|      _|_|_|_|    _|_|      _|_|_|  _|_|_|_|
+    _|    _|  _|    _|  _|        _|          _|    _|_|    _|  _|            _|        _|    _|  _|        _|
+    _|_|_|_|  _|    _|  _|  _|_|  _|  _|_|    _|    _|  _|  _|  _|  _|_|      _|_|_|    _|_|_|_|  _|        _|_|_|
+    _|    _|  _|    _|  _|    _|  _|    _|    _|    _|    _|_|  _|    _|      _|        _|    _|  _|        _|
+    _|    _|    _|_|      _|_|_|    _|_|_|  _|_|_|  _|      _|    _|_|_|      _|        _|    _|    _|_|_|  _|_|_|_|
+    To login, `huggingface_hub` requires a token generated from https://huggingface.co/settings/tokens .
+Enter your token (input will not be visible): Traceback (most recent call last):
+  File "/opt/conda/bin/huggingface-cli", line 8, in <module>
+    sys.exit(main())
+  File "/opt/conda/lib/python3.10/site-packages/huggingface_hub/commands/huggingface_cli.py", line 51, in main
+    service.run()
+  File "/opt/conda/lib/python3.10/site-packages/huggingface_hub/commands/user.py", line 98, in run
+    login(token=self.args.token, add_to_git_credential=self.args.add_to_git_credential)
+  File "/opt/conda/lib/python3.10/site-packages/huggingface_hub/_login.py", line 115, in login
+    interpreter_login(new_session=new_session, write_permission=write_permission)
+  File "/opt/conda/lib/python3.10/site-packages/huggingface_hub/_login.py", line 191, in interpreter_login
+    token = getpass("Enter your token (input will not be visible): ")
+  File "/opt/conda/lib/python3.10/getpass.py", line 77, in unix_getpass
+    passwd = _raw_input(prompt, stream, input=input)
+  File "/opt/conda/lib/python3.10/getpass.py", line 146, in _raw_input
+    line = input.readline()
+  File "/opt/conda/lib/python3.10/codecs.py", line 319, in decode
+    def decode(self, input, final=False):
+KeyboardInterrupt
+huggingface/tokenizers: The current process just got forked, after parallelism has already been used. Disabling parallelism to avoid deadlocks...
+To disable this warning, you can either:
+	- Avoid using `tokenizers` before the fork if possible
+	- Explicitly set the environment variable TOKENIZERS_PARALLELISM=(true | false)
+    _|    _|  _|    _|    _|_|_|    _|_|_|  _|_|_|  _|      _|    _|_|_|      _|_|_|_|    _|_|      _|_|_|  _|_|_|_|
+    _|    _|  _|    _|  _|        _|          _|    _|_|    _|  _|            _|        _|    _|  _|        _|
+    _|_|_|_|  _|    _|  _|  _|_|  _|  _|_|    _|    _|  _|  _|  _|  _|_|      _|_|_|    _|_|_|_|  _|        _|_|_|
+    _|    _|  _|    _|  _|    _|  _|    _|    _|    _|    _|_|  _|    _|      _|        _|    _|  _|        _|
+    _|    _|    _|_|      _|_|_|    _|_|_|  _|_|_|  _|      _|    _|_|_|      _|        _|    _|    _|_|_|  _|_|_|_|
+    To login, `huggingface_hub` requires a token generated from https://huggingface.co/settings/tokens .
+Enter your token (input will not be visible): Traceback (most recent call last):
+  File "/opt/conda/bin/huggingface-cli", line 8, in <module>
+    sys.exit(main())
+  File "/opt/conda/lib/python3.10/site-packages/huggingface_hub/commands/huggingface_cli.py", line 51, in main
+    service.run()
+  File "/opt/conda/lib/python3.10/site-packages/huggingface_hub/commands/user.py", line 98, in run
+    login(token=self.args.token, add_to_git_credential=self.args.add_to_git_credential)
+  File "/opt/conda/lib/python3.10/site-packages/huggingface_hub/_login.py", line 115, in login
+    interpreter_login(new_session=new_session, write_permission=write_permission)
+  File "/opt/conda/lib/python3.10/site-packages/huggingface_hub/_login.py", line 191, in interpreter_login
+    token = getpass("Enter your token (input will not be visible): ")
+  File "/opt/conda/lib/python3.10/getpass.py", line 77, in unix_getpass
+    passwd = _raw_input(prompt, stream, input=input)
+  File "/opt/conda/lib/python3.10/getpass.py", line 146, in _raw_input
+    line = input.readline()
+  File "/opt/conda/lib/python3.10/codecs.py", line 319, in decode
+    def decode(self, input, final=False):
+KeyboardInterrupt
+/opt/conda/lib/python3.10/site-packages/huggingface_hub/file_download.py:1132: FutureWarning: `resume_download` is deprecated and will be removed in version 1.0.0. Downloads always resume when possible. If you want to force a new download, use `force_download=True`.
+  warnings.warn(
+/opt/conda/lib/python3.10/site-packages/huggingface_hub/file_download.py:1132: FutureWarning: `resume_download` is deprecated and will be removed in version 1.0.0. Downloads always resume when possible. If you want to force a new download, use `force_download=True`.

wandb/run-20240522_113413-8mudzhjp/files/requirements.txt ADDED Viewed

	@@ -0,0 +1,878 @@

+Babel==2.14.0
+Boruta==0.3
+Brotli==1.0.9
+CVXcanon==0.1.2
+Cartopy==0.23.0
+Cython==3.0.8
+Deprecated==1.2.14
+Farama-Notifications==0.0.4
+Flask==3.0.3
+Geohash==1.0
+GitPython==3.1.41
+ImageHash==4.3.1
+Janome==0.5.0
+Jinja2==3.1.2
+LunarCalendar==0.0.9
+Mako==1.3.3
+Markdown==3.5.2
+MarkupSafe==2.1.3
+MarkupSafe==2.1.5
+Pillow==9.5.0
+PuLP==2.8.0
+PyArabic==0.6.15
+PyJWT==2.8.0
+PyMeeus==0.5.12
+PySocks==1.7.1
+PyUpSet==0.1.1.post7
+PyWavelets==1.5.0
+PyYAML==6.0.1
+Pygments==2.17.2
+Pympler==1.0.1
+QtPy==2.4.1
+Rtree==1.2.0
+SQLAlchemy==2.0.25
+SecretStorage==3.3.3
+Send2Trash==1.8.2
+Shapely==1.8.5.post1
+Shimmy==1.3.0
+SimpleITK==2.3.1
+TPOT==0.12.1
+Theano-PyMC==1.1.2
+Theano==1.0.5
+Wand==0.6.13
+Werkzeug==3.0.2
+absl-py==1.4.0
+accelerate==0.30.1
+access==1.1.9
+affine==2.4.0
+aiobotocore==2.12.3
+aiofiles==22.1.0
+aiohttp-cors==0.7.0
+aiohttp==3.9.1
+aioitertools==0.11.0
+aiorwlock==1.3.0
+aiosignal==1.3.1
+aiosqlite==0.19.0
+albumentations==1.4.0
+alembic==1.13.1
+altair==5.3.0
+annotated-types==0.6.0
+annoy==1.17.3
+anyio==4.2.0
+apache-beam==2.46.0
+aplus==0.11.0
+appdirs==1.4.4
+archspec==0.2.3
+argon2-cffi-bindings==21.2.0
+argon2-cffi==23.1.0
+array-record==0.5.0
+arrow==1.3.0
+arviz==0.18.0
+astroid==3.1.0
+astropy-iers-data==0.2024.4.15.2.45.49
+astropy==6.0.1
+asttokens==2.4.1
+astunparse==1.6.3
+async-lru==2.0.4
+async-timeout==4.0.3
+attrs==23.2.0
+audioread==3.0.1
+autopep8==2.0.4
+backoff==2.2.1
+bayesian-optimization==1.4.3
+beatrix_jupyterlab==2023.128.151533
+beautifulsoup4==4.12.2
+bitsandbytes==0.43.1
+blake3==0.2.1
+bleach==6.1.0
+blessed==1.20.0
+blinker==1.7.0
+blis==0.7.10
+blosc2==2.6.2
+bokeh==3.4.1
+boltons==23.1.1
+boto3==1.26.100
+botocore==1.34.69
+bq_helper==0.4.1
+bqplot==0.12.43
+branca==0.7.1
+brewer2mpl==1.4.1
+brotlipy==0.7.0
+cached-property==1.5.2
+cachetools==4.2.4
+cachetools==5.3.2
+catalogue==2.0.10
+catalyst==22.4
+catboost==1.2.3
+category-encoders==2.6.3
+certifi==2024.2.2
+cesium==0.12.1
+cffi==1.16.0
+charset-normalizer==3.3.2
+chex==0.1.86
+cleverhans==4.0.0
+click-plugins==1.1.1
+click==8.1.7
+cligj==0.7.2
+cloud-tpu-client==0.10
+cloud-tpu-profiler==2.4.0
+cloudpathlib==0.16.0
+cloudpickle==2.2.1
+cloudpickle==3.0.0
+cmdstanpy==1.2.2
+colorama==0.4.6
+colorcet==3.1.0
+colorful==0.5.6
+colorlog==6.8.2
+colorlover==0.3.0
+comm==0.2.1
+conda-libmamba-solver==23.7.0
+conda-package-handling==2.2.0
+conda==23.7.4
+conda_package_streaming==0.9.0
+confection==0.1.4
+contextily==1.6.0
+contourpy==1.2.0
+contourpy==1.2.1
+convertdate==2.4.0
+crcmod==1.7
+cryptography==41.0.7
+cuda-python==12.4.0
+cudf==23.8.0
+cufflinks==0.17.3
+cuml==23.8.0
+cupy==13.0.0
+cycler==0.12.1
+cymem==2.0.8
+cytoolz==0.12.3
+daal4py==2024.3.0
+daal==2024.3.0
+dacite==1.8.1
+dask-cuda==23.8.0
+dask-cudf==23.8.0
+dask-expr==1.0.11
+dask==2024.4.1
+dataclasses-json==0.6.4
+dataproc_jupyter_plugin==0.1.66
+datasets==2.18.0
+datashader==0.16.0
+datatile==1.0.3
+db-dtypes==1.2.0
+deap==1.4.1
+debugpy==1.8.0
+decorator==5.1.1
+deepdiff==7.0.1
+defusedxml==0.7.1
+deprecation==2.1.0
+descartes==1.1.0
+dill==0.3.8
+dipy==1.9.0
+distlib==0.3.8
+distributed==2023.7.1
+distro==1.9.0
+dm-tree==0.1.8
+docker-pycreds==0.4.0
+docker==7.0.0
+docopt==0.6.2
+docstring-parser==0.15
+docstring-to-markdown==0.15
+docutils==0.21.1
+earthengine-api==0.1.399
+easydict==1.13
+easyocr==1.7.1
+ecos==2.0.13
+einops==0.8.0
+eli5==0.13.0
+emoji==2.11.0
+en-core-web-lg==3.7.1
+en-core-web-sm==3.7.1
+entrypoints==0.4
+ephem==4.1.5
+esda==2.5.1
+essentia==2.1b6.dev1110
+et-xmlfile==1.1.0
+etils==1.6.0
+exceptiongroup==1.2.0
+executing==2.0.1
+explainable-ai-sdk==1.3.3
+fastai==2.7.14
+fastapi==0.108.0
+fastavro==1.9.3
+fastcore==1.5.29
+fastdownload==0.0.7
+fasteners==0.19
+fastjsonschema==2.19.1
+fastprogress==1.0.3
+fastrlock==0.8.2
+fasttext==0.9.2
+feather-format==0.4.1
+featuretools==1.30.0
+filelock==3.13.1
+fiona==1.9.6
+fitter==1.7.0
+flake8==7.0.0
+flashtext==2.7
+flatbuffers==23.5.26
+flax==0.8.2
+folium==0.16.0
+fonttools==4.47.0
+fonttools==4.51.0
+fqdn==1.5.1
+frozendict==2.4.2
+frozenlist==1.4.1
+fsspec==2024.2.0
+fsspec==2024.3.1
+funcy==2.0
+fury==0.10.0
+future==1.0.0
+fuzzywuzzy==0.18.0
+gast==0.5.4
+gatspy==0.3
+gcsfs==2024.2.0
+gensim==4.3.2
+geographiclib==2.0
+geojson==3.1.0
+geopandas==0.14.3
+geoplot==0.5.1
+geopy==2.4.1
+geoviews==1.12.0
+ggplot==0.11.5
+giddy==2.3.5
+gitdb==4.0.11
+google-ai-generativelanguage==0.6.2
+google-api-core==2.11.1
+google-api-core==2.18.0
+google-api-python-client==2.126.0
+google-apitools==0.5.31
+google-auth-httplib2==0.2.0
+google-auth-oauthlib==1.2.0
+google-auth==2.26.1
+google-cloud-aiplatform==0.6.0a1
+google-cloud-artifact-registry==1.10.0
+google-cloud-automl==1.0.1
+google-cloud-bigquery==2.34.4
+google-cloud-bigtable==1.7.3
+google-cloud-core==2.4.1
+google-cloud-datastore==2.19.0
+google-cloud-dlp==3.14.0
+google-cloud-jupyter-config==0.0.5
+google-cloud-language==2.13.3
+google-cloud-monitoring==2.18.0
+google-cloud-pubsub==2.19.0
+google-cloud-pubsublite==1.9.0
+google-cloud-recommendations-ai==0.7.1
+google-cloud-resource-manager==1.11.0
+google-cloud-spanner==3.40.1
+google-cloud-storage==1.44.0
+google-cloud-translate==3.12.1
+google-cloud-videointelligence==2.13.3
+google-cloud-vision==2.8.0
+google-crc32c==1.5.0
+google-generativeai==0.5.1
+google-pasta==0.2.0
+google-resumable-media==2.7.0
+googleapis-common-protos==1.62.0
+gplearn==0.4.2
+gpustat==1.0.0
+gpxpy==1.6.2
+graphviz==0.20.3
+greenlet==3.0.3
+grpc-google-iam-v1==0.12.7
+grpcio-status==1.48.1
+grpcio-status==1.48.2
+grpcio==1.51.1
+grpcio==1.60.0
+gviz-api==1.10.0
+gym-notices==0.0.8
+gym==0.26.2
+gymnasium==0.29.0
+h11==0.14.0
+h2o==3.46.0.1
+h5netcdf==1.3.0
+h5py==3.10.0
+haversine==2.8.1
+hdfs==2.7.3
+hep-ml==0.7.2
+hijri-converter==2.3.1
+hmmlearn==0.3.2
+holidays==0.24
+holoviews==1.18.3
+hpsklearn==0.1.0
+html5lib==1.1
+htmlmin==0.1.12
+httpcore==1.0.5
+httplib2==0.21.0
+httptools==0.6.1
+httpx==0.27.0
+huggingface-hub==0.23.1
+hunspell==0.5.5
+hydra-slayer==0.5.0
+hyperopt==0.2.7
+hypertools==0.8.0
+idna==3.6
+igraph==0.11.4
+imagecodecs==2024.1.1
+imageio==2.33.1
+imbalanced-learn==0.12.2
+imgaug==0.4.0
+importlib-metadata==6.11.0
+importlib-metadata==7.0.1
+importlib-resources==6.1.1
+inequality==1.0.1
+iniconfig==2.0.0
+ipydatawidgets==4.3.5
+ipykernel==6.28.0
+ipyleaflet==0.18.2
+ipympl==0.7.0
+ipython-genutils==0.2.0
+ipython-genutils==0.2.0
+ipython-sql==0.5.0
+ipython==8.20.0
+ipyvolume==0.6.3
+ipyvue==1.11.0
+ipyvuetify==1.9.4
+ipywebrtc==0.6.0
+ipywidgets==7.7.1
+isoduration==20.11.0
+isort==5.13.2
+isoweek==1.3.3
+itsdangerous==2.2.0
+jaraco.classes==3.3.0
+jax-jumpy==1.0.0
+jax==0.4.23
+jaxlib==0.4.23.dev20240116
+jedi==0.19.1
+jeepney==0.8.0
+jieba==0.42.1
+jmespath==1.0.1
+joblib==1.4.0
+json5==0.9.14
+jsonpatch==1.33
+jsonpointer==2.4
+jsonschema-specifications==2023.12.1
+jsonschema==4.20.0
+jupyter-console==6.6.3
+jupyter-events==0.9.0
+jupyter-http-over-ws==0.0.8
+jupyter-lsp==1.5.1
+jupyter-server-mathjax==0.2.6
+jupyter-ydoc==0.2.5
+jupyter_client==7.4.9
+jupyter_client==8.6.0
+jupyter_core==5.7.1
+jupyter_server==2.12.5
+jupyter_server_fileid==0.9.1
+jupyter_server_proxy==4.1.0
+jupyter_server_terminals==0.5.1
+jupyter_server_ydoc==0.8.0
+jupyterlab-lsp==5.1.0
+jupyterlab-widgets==3.0.9
+jupyterlab==4.1.6
+jupyterlab_git==0.44.0
+jupyterlab_pygments==0.3.0
+jupyterlab_server==2.25.2
+jupytext==1.16.0
+kaggle-environments==1.14.3
+kaggle==1.6.12
+kagglehub==0.2.3
+keras-cv==0.8.2
+keras-nlp==0.9.3
+keras-tuner==1.4.6
+keras==3.2.1
+kernels-mixer==0.0.7
+keyring==24.3.0
+keyrings.google-artifactregistry-auth==1.1.2
+kfp-pipeline-spec==0.2.2
+kfp-server-api==2.0.5
+kfp==2.5.0
+kiwisolver==1.4.5
+kmapper==2.0.1
+kmodes==0.12.2
+korean-lunar-calendar==0.3.1
+kornia==0.7.2
+kornia_rs==0.1.3
+kt-legacy==1.0.5
+kubernetes==26.1.0
+langcodes==3.3.0
+langid==1.1.6
+lazy_loader==0.3
+learntools==0.3.4
+leven==1.0.4
+libclang==16.0.6
+libmambapy==1.5.0
+libpysal==4.9.2
+librosa==0.10.1
+lightgbm==4.2.0
+lightning-utilities==0.11.2
+lime==0.2.0.1
+line-profiler==4.1.2
+linkify-it-py==2.0.3
+llvmlite==0.41.1
+llvmlite==0.42.0
+lml==0.1.0
+locket==1.0.0
+loguru==0.7.2
+loralib==0.1.2
+lxml==5.2.1
+lz4==4.3.3
+mamba==1.5.0
+mapclassify==2.6.1
+markdown-it-py==3.0.0
+marshmallow==3.21.1
+matplotlib-inline==0.1.6
+matplotlib-venn==0.11.10
+matplotlib==3.7.5
+matplotlib==3.8.4
+mccabe==0.7.0
+mdit-py-plugins==0.4.0
+mdurl==0.1.2
+memory-profiler==0.61.0
+menuinst==2.0.1
+mercantile==1.2.1
+mgwr==2.2.1
+missingno==0.5.2
+mistune==0.8.4
+mizani==0.11.1
+ml-dtypes==0.2.0
+mlcrate==0.2.0
+mlens==0.2.3
+mlxtend==0.23.1
+mne==1.6.1
+mnist==0.2.2
+momepy==0.7.0
+more-itertools==10.2.0
+mpld3==0.5.10
+mpmath==1.3.0
+msgpack==1.0.7
+multidict==6.0.4
+multimethod==1.10
+multipledispatch==1.0.0
+multiprocess==0.70.16
+munkres==1.1.4
+murmurhash==1.0.10
+mypy-extensions==1.0.0
+namex==0.0.8
+nb-conda-kernels==2.3.1
+nb_conda==2.2.1
+nbclassic==1.0.0
+nbclient==0.5.13
+nbconvert==6.4.5
+nbdime==3.2.0
+nbformat==5.9.2
+ndindex==1.8
+nest-asyncio==1.5.8
+networkx==3.2.1
+nibabel==5.2.1
+nilearn==0.10.4
+ninja==1.11.1.1
+nltk==3.2.4
+nose==1.3.7
+notebook==6.5.4
+notebook==6.5.6
+notebook_executor==0.2
+notebook_shim==0.2.3
+numba==0.58.1
+numba==0.59.1
+numexpr==2.10.0
+numpy==1.26.4
+nvidia-cublas-cu12==12.1.3.1
+nvidia-cuda-cupti-cu12==12.1.105
+nvidia-cuda-nvrtc-cu12==12.1.105
+nvidia-cuda-runtime-cu12==12.1.105
+nvidia-cudnn-cu12==8.9.2.26
+nvidia-cufft-cu12==11.0.2.54
+nvidia-curand-cu12==10.3.2.106
+nvidia-cusolver-cu12==11.4.5.107
+nvidia-cusparse-cu12==12.1.0.106
+nvidia-ml-py==11.495.46
+nvidia-nccl-cu12==2.20.5
+nvidia-nvjitlink-cu12==12.5.40
+nvidia-nvtx-cu12==12.1.105
+nvtx==0.2.10
+oauth2client==4.1.3
+oauthlib==3.2.2
+objsize==0.6.1
+odfpy==1.4.1
+olefile==0.47
+onnx==1.16.0
+opencensus-context==0.1.3
+opencensus==0.11.4
+opencv-contrib-python==4.9.0.80
+opencv-python-headless==4.9.0.80
+opencv-python==4.9.0.80
+openpyxl==3.1.2
+openslide-python==1.3.1
+opentelemetry-api==1.22.0
+opentelemetry-exporter-otlp-proto-common==1.22.0
+opentelemetry-exporter-otlp-proto-grpc==1.22.0
+opentelemetry-exporter-otlp-proto-http==1.22.0
+opentelemetry-exporter-otlp==1.22.0
+opentelemetry-proto==1.22.0
+opentelemetry-sdk==1.22.0
+opentelemetry-semantic-conventions==0.43b0
+opt-einsum==3.3.0
+optax==0.2.2
+optree==0.11.0
+optuna==3.6.1
+orbax-checkpoint==0.5.9
+ordered-set==4.1.0
+orjson==3.9.10
+ortools==9.4.1874
+osmnx==1.9.2
+overrides==7.4.0
+packaging==21.3
+pandas-datareader==0.10.0
+pandas-profiling==3.6.6
+pandas-summary==0.2.0
+pandas==2.1.4
+pandas==2.2.2
+pandasql==0.7.3
+pandocfilters==1.5.0
+panel==1.4.1
+papermill==2.5.0
+param==2.1.0
+parso==0.8.3
+partd==1.4.1
+path.py==12.5.0
+path==16.14.0
+pathos==0.3.2
+pathy==0.10.3
+patsy==0.5.6
+pdf2image==1.17.0
+peft==0.11.1
+pettingzoo==1.24.0
+pexpect==4.8.0
+pexpect==4.9.0
+phik==0.12.4
+pickleshare==0.7.5
+pillow==10.3.0
+pip==23.3.2
+pkgutil_resolve_name==1.3.10
+platformdirs==4.2.0
+plotly-express==0.4.1
+plotly==5.18.0
+plotnine==0.13.4
+pluggy==1.4.0
+pointpats==2.4.0
+polars==0.20.21
+polyglot==16.7.4
+pooch==1.8.1
+pox==0.3.4
+ppca==0.0.4
+ppft==1.7.6.8
+preprocessing==0.1.13
+preshed==3.0.9
+prettytable==3.9.0
+progressbar2==4.4.2
+prometheus-client==0.19.0
+promise==2.3
+prompt-toolkit==3.0.42
+prompt-toolkit==3.0.43
+prophet==1.1.1
+proto-plus==1.23.0
+protobuf==3.20.3
+protobuf==4.21.12
+psutil==5.9.3
+psutil==5.9.7
+ptyprocess==0.7.0
+pudb==2024.1
+pure-eval==0.2.2
+py-cpuinfo==9.0.0
+py-spy==0.3.14
+py4j==0.10.9.7
+pyLDAvis==3.4.1
+pyOpenSSL==23.3.0
+pyaml==23.12.0
+pyarrow-hotfix==0.6
+pyarrow==15.0.2
+pyasn1-modules==0.3.0
+pyasn1==0.5.1
+pybind11==2.12.0
+pyclipper==1.3.0.post5
+pycodestyle==2.11.1
+pycosat==0.6.6
+pycparser==2.21
+pycryptodome==3.20.0
+pyct==0.5.0
+pycuda==2024.1
+pydantic==2.5.3
+pydantic==2.7.0
+pydantic_core==2.14.6
+pydantic_core==2.18.1
+pydegensac==0.1.2
+pydicom==2.4.4
+pydocstyle==6.3.0
+pydot==1.4.2
+pydub==0.25.1
+pyemd==1.0.0
+pyerfa==2.0.1.4
+pyexcel-io==0.6.6
+pyexcel-ods==0.6.0
+pyflakes==3.2.0
+pygltflib==1.16.2
+pykalman==0.9.7
+pylibraft==23.8.0
+pylint==3.1.0
+pymc3==3.11.4
+pymongo==3.13.0
+pynndescent==0.5.12
+pynvml==11.4.1
+pynvrtc==9.2
+pyparsing==3.1.1
+pyparsing==3.1.2
+pypdf==4.2.0
+pyproj==3.6.1
+pysal==24.1
+pyshp==2.3.1
+pytesseract==0.3.10
+pytest==8.1.1
+python-bidi==0.4.2
+python-dateutil==2.9.0.post0
+python-dotenv==1.0.0
+python-json-logger==2.0.7
+python-louvain==0.16
+python-lsp-jsonrpc==1.1.2
+python-lsp-server==1.11.0
+python-slugify==8.0.4
+python-utils==3.8.2
+pythreejs==2.4.2
+pytoolconfig==1.3.1
+pytools==2024.1.1
+pytorch-ignite==0.5.0.post2
+pytorch-lightning==2.2.2
+pytz==2023.3.post1
+pytz==2024.1
+pyu2f==0.1.5
+pyviz_comms==3.0.2
+pyzmq==24.0.1
+pyzmq==25.1.2
+qgrid==1.3.1
+qtconsole==5.5.1
+quantecon==0.7.2
+qudida==0.0.4
+raft-dask==23.8.0
+rasterio==1.3.10
+rasterstats==0.19.0
+ray-cpp==2.9.0
+ray==2.9.0
+referencing==0.32.1
+regex==2023.12.25
+requests-oauthlib==1.3.1
+requests-toolbelt==0.10.1
+requests==2.31.0
+retrying==1.3.3
+retrying==1.3.4
+rfc3339-validator==0.1.4
+rfc3986-validator==0.1.1
+rgf-python==3.12.0
+rich-click==1.7.4
+rich==13.7.0
+rich==13.7.1
+rmm==23.8.0
+rope==1.13.0
+rpds-py==0.16.2
+rsa==4.9
+ruamel-yaml-conda==0.15.100
+ruamel.yaml.clib==0.2.7
+ruamel.yaml==0.17.40
+s2sphere==0.2.5
+s3fs==2024.2.0
+s3transfer==0.6.2
+safetensors==0.4.3
+scattertext==0.1.19
+scikit-image==0.22.0
+scikit-learn-intelex==2024.3.0
+scikit-learn==1.2.2
+scikit-multilearn==0.2.0
+scikit-optimize==0.10.1
+scikit-plot==0.3.7
+scikit-surprise==1.1.3
+scipy==1.11.4
+scipy==1.13.0
+seaborn==0.12.2
+segment_anything==1.0
+segregation==2.5
+semver==3.0.2
+sentencepiece==0.2.0
+sentry-sdk==1.45.0
+setproctitle==1.3.3
+setuptools-git==1.2
+setuptools-scm==8.0.4
+setuptools==69.0.3
+shap==0.44.1
+shapely==2.0.4
+shellingham==1.5.4
+simpervisor==1.0.0
+simplejson==3.19.2
+six==1.16.0
+sklearn-pandas==2.2.0
+slicer==0.0.7
+smart-open==6.4.0
+smmap==5.0.1
+sniffio==1.3.0
+snowballstemmer==2.2.0
+snuggs==1.4.7
+sortedcontainers==2.4.0
+soundfile==0.12.1
+soupsieve==2.5
+soxr==0.3.7
+spacy-legacy==3.0.12
+spacy-loggers==1.0.5
+spacy==3.7.3
+spaghetti==1.7.5.post1
+spectral==0.23.1
+spglm==1.1.0
+sphinx-rtd-theme==0.2.4
+spint==1.0.7
+splot==1.1.5.post1
+spopt==0.6.0
+spreg==1.4.2
+spvcm==0.3.0
+sqlparse==0.4.4
+squarify==0.4.3
+srsly==2.4.8
+stable-baselines3==2.1.0
+stack-data==0.6.2
+stack-data==0.6.3
+stanio==0.5.0
+starlette==0.32.0.post1
+statsmodels==0.14.1
+stemming==1.0.1
+stop-words==2018.7.23
+stopit==1.1.2
+stumpy==1.12.0
+sympy==1.12
+tables==3.9.2
+tabulate==0.9.0
+tangled-up-in-unicode==0.2.0
+tbb==2021.12.0
+tblib==3.0.0
+tenacity==8.2.3
+tensorboard-data-server==0.7.2
+tensorboard-plugin-profile==2.15.0
+tensorboard==2.15.1
+tensorboardX==2.6.2.2
+tensorflow-cloud==0.1.16
+tensorflow-datasets==4.9.4
+tensorflow-decision-forests==1.8.1
+tensorflow-estimator==2.15.0
+tensorflow-hub==0.16.1
+tensorflow-io-gcs-filesystem==0.35.0
+tensorflow-io==0.35.0
+tensorflow-metadata==0.14.0
+tensorflow-probability==0.23.0
+tensorflow-serving-api==2.14.1
+tensorflow-text==2.15.0
+tensorflow-transform==0.14.0
+tensorflow==2.15.0
+tensorstore==0.1.56
+termcolor==2.4.0
+terminado==0.18.0
+testpath==0.6.0
+text-unidecode==1.3
+textblob==0.18.0.post0
+texttable==1.7.0
+tf_keras==2.15.1
+tfp-nightly==0.24.0.dev0
+thinc==8.2.2
+threadpoolctl==3.2.0
+tifffile==2023.12.9
+timm==0.9.16
+tinycss2==1.2.1
+tobler==0.11.2
+tokenizers==0.19.1
+toml==0.10.2
+tomli==2.0.1
+tomlkit==0.12.4
+toolz==0.12.1
+torch==2.3.0
+torchaudio==2.1.2
+torchdata==0.7.1
+torchinfo==1.8.0
+torchmetrics==1.3.2
+torchtext==0.16.2
+torchvision==0.16.2
+tornado==6.3.3
+tqdm==4.66.1
+traceml==1.0.8
+traitlets==5.9.0
+traittypes==0.2.1
+transformers==4.41.0
+treelite-runtime==3.2.0
+treelite==3.2.0
+triton==2.3.0
+truststore==0.8.0
+trx-python==0.2.9
+tsfresh==0.20.2
+typeguard==4.1.5
+typer==0.9.0
+typer==0.9.4
+types-python-dateutil==2.8.19.20240106
+typing-inspect==0.9.0
+typing-utils==0.1.0
+typing_extensions==4.9.0
+tzdata==2023.4
+uc-micro-py==1.0.3
+ucx-py==0.33.0
+ujson==5.9.0
+umap-learn==0.5.6
+unicodedata2==15.1.0
+update-checker==0.18.0
+uri-template==1.3.0
+uritemplate==3.0.1
+urllib3==1.26.18
+urllib3==2.1.0
+urwid==2.6.10
+urwid_readline==0.14
+uvicorn==0.25.0
+uvloop==0.19.0
+vaex-astro==0.9.3
+vaex-core==4.17.1
+vaex-hdf5==0.14.1
+vaex-jupyter==0.8.2
+vaex-ml==0.18.3
+vaex-server==0.9.0
+vaex-viz==0.5.4
+vaex==4.17.0
+vec_noise==1.1.4
+vecstack==0.4.0
+virtualenv==20.21.0
+visions==0.7.5
+vowpalwabbit==9.9.0
+vtk==9.3.0
+wandb==0.16.6
+wasabi==1.1.2
+watchfiles==0.21.0
+wavio==0.0.8
+wcwidth==0.2.13
+weasel==0.3.4
+webcolors==1.13
+webencodings==0.5.1
+websocket-client==1.7.0
+websockets==12.0
+wfdb==4.1.2
+whatthepatch==1.0.5
+wheel==0.42.0
+widgetsnbextension==3.6.6
+witwidget==1.8.1
+woodwork==0.30.0
+wordcloud==1.9.3
+wordsegment==1.3.1
+wrapt==1.14.1
+xarray-einstats==0.7.0
+xarray==2024.3.0
+xformers==0.0.26.post1
+xgboost==2.0.3
+xvfbwrapper==0.2.9
+xxhash==3.4.1
+xyzservices==2024.4.0
+y-py==0.6.2
+yapf==0.40.2
+yarl==1.9.3
+yarl==1.9.4
+ydata-profiling==4.6.4
+yellowbrick==1.5
+ypy-websocket==0.8.4
+zict==3.0.0
+zipp==3.17.0
+zstandard==0.22.0

wandb/run-20240522_113413-8mudzhjp/files/wandb-metadata.json ADDED Viewed

	@@ -0,0 +1,62 @@

+{
+    "os": "Linux-5.15.133+-x86_64-with-glibc2.31",
+    "python": "3.10.13",
+    "heartbeatAt": "2024-05-22T11:34:14.876249",
+    "startedAt": "2024-05-22T11:34:13.994936",
+    "docker": null,
+    "cuda": null,
+    "args": [],
+    "state": "running",
+    "program": "kaggle.ipynb",
+    "codePathLocal": null,
+    "root": "/kaggle/working",
+    "host": "2c1b614ec68f",
+    "username": "root",
+    "executable": "/opt/conda/bin/python3.10",
+    "cpu_count": 2,
+    "cpu_count_logical": 4,
+    "cpu_freq": {
+        "current": 2000.144,
+        "min": 0.0,
+        "max": 0.0
+    },
+    "cpu_freq_per_core": [
+        {
+            "current": 2000.144,
+            "min": 0.0,
+            "max": 0.0
+        },
+        {
+            "current": 2000.144,
+            "min": 0.0,
+            "max": 0.0
+        },
+        {
+            "current": 2000.144,
+            "min": 0.0,
+            "max": 0.0
+        },
+        {
+            "current": 2000.144,
+            "min": 0.0,
+            "max": 0.0
+        }
+    ],
+    "disk": {
+        "/": {
+            "total": 8062.387607574463,
+            "used": 5656.421318054199
+        }
+    },
+    "gpu": "Tesla P100-PCIE-16GB",
+    "gpu_count": 1,
+    "gpu_devices": [
+        {
+            "name": "Tesla P100-PCIE-16GB",
+            "memory_total": 17179869184
+        }
+    ],
+    "memory": {
+        "total": 31.357563018798828
+    }
+}

wandb/run-20240522_113413-8mudzhjp/files/wandb-summary.json ADDED Viewed

	@@ -0,0 +1 @@


1	+ {"train/loss": 3.5063, "train/grad_norm": 2.443696975708008, "train/learning_rate": 0.0, "train/epoch": 2.0, "train/global_step": 2000, "_timestamp": 1716386681.8685813, "_runtime": 9027.865963220596, "_step": 226, "train_runtime": 7028.5663, "train_samples_per_second": 0.285, "train_steps_per_second": 0.285, "total_flos": 1.046896491923424e+16, "train_loss": 3.840830388069153}

wandb/run-20240522_113413-8mudzhjp/logs/debug-internal.log ADDED Viewed

The diff for this file is too large to render. See raw diff

wandb/run-20240522_113413-8mudzhjp/logs/debug.log ADDED Viewed

	@@ -0,0 +1,54 @@

+2024-05-22 11:34:13,996 INFO    MainThread:217 [wandb_setup.py:_flush():76] Current SDK version is 0.16.6
+2024-05-22 11:34:13,997 INFO    MainThread:217 [wandb_setup.py:_flush():76] Configure stats pid to 217
+2024-05-22 11:34:13,997 INFO    MainThread:217 [wandb_setup.py:_flush():76] Loading settings from /root/.config/wandb/settings
+2024-05-22 11:34:13,997 INFO    MainThread:217 [wandb_setup.py:_flush():76] Loading settings from /kaggle/working/wandb/settings
+2024-05-22 11:34:13,997 INFO    MainThread:217 [wandb_setup.py:_flush():76] Loading settings from environment variables: {}
+2024-05-22 11:34:13,997 INFO    MainThread:217 [wandb_setup.py:_flush():76] Applying setup settings: {'_disable_service': False}
+2024-05-22 11:34:13,997 INFO    MainThread:217 [wandb_setup.py:_flush():76] Inferring run settings from compute environment: {'program': '<python with no main file>'}
+2024-05-22 11:34:13,997 INFO    MainThread:217 [wandb_setup.py:_flush():76] Applying login settings: {}
+2024-05-22 11:34:13,997 INFO    MainThread:217 [wandb_init.py:_log_setup():521] Logging user logs to /kaggle/working/wandb/run-20240522_113413-8mudzhjp/logs/debug.log
+2024-05-22 11:34:13,997 INFO    MainThread:217 [wandb_init.py:_log_setup():522] Logging internal logs to /kaggle/working/wandb/run-20240522_113413-8mudzhjp/logs/debug-internal.log
+2024-05-22 11:34:13,997 INFO    MainThread:217 [wandb_init.py:_jupyter_setup():467] configuring jupyter hooks <wandb.sdk.wandb_init._WandbInit object at 0x7ef92390cee0>
+2024-05-22 11:34:13,998 INFO    MainThread:217 [wandb_init.py:init():561] calling init triggers
+2024-05-22 11:34:13,998 INFO    MainThread:217 [wandb_init.py:init():568] wandb.init called with sweep_config: {}
+config: {}
+2024-05-22 11:34:13,998 INFO    MainThread:217 [wandb_init.py:init():611] starting backend
+2024-05-22 11:34:13,998 INFO    MainThread:217 [wandb_init.py:init():615] setting up manager
+2024-05-22 11:34:14,000 INFO    MainThread:217 [backend.py:_multiprocessing_setup():105] multiprocessing start_methods=fork,spawn,forkserver, using: spawn
+2024-05-22 11:34:14,002 INFO    MainThread:217 [wandb_init.py:init():623] backend started and connected
+2024-05-22 11:34:14,016 INFO    MainThread:217 [wandb_run.py:_label_probe_notebook():1299] probe notebook
+2024-05-22 11:34:14,540 INFO    MainThread:217 [wandb_init.py:init():715] updated telemetry
+2024-05-22 11:34:14,544 INFO    MainThread:217 [wandb_init.py:init():748] communicating run to backend with 90.0 second timeout
+2024-05-22 11:34:14,778 INFO    MainThread:217 [wandb_run.py:_on_init():2357] communicating current version
+2024-05-22 11:34:14,843 INFO    MainThread:217 [wandb_run.py:_on_init():2366] got version response upgrade_message: "wandb version 0.17.0 is available!  To upgrade, please run:\n $ pip install wandb --upgrade"
+2024-05-22 11:34:14,844 INFO    MainThread:217 [wandb_init.py:init():799] starting run threads in backend
+2024-05-22 11:34:30,856 INFO    MainThread:217 [wandb_run.py:_console_start():2335] atexit reg
+2024-05-22 11:34:30,857 INFO    MainThread:217 [wandb_run.py:_redirect():2190] redirect: wrap_raw
+2024-05-22 11:34:30,857 INFO    MainThread:217 [wandb_run.py:_redirect():2255] Wrapping output streams.
+2024-05-22 11:34:30,857 INFO    MainThread:217 [wandb_run.py:_redirect():2280] Redirects installed.
+2024-05-22 11:34:30,858 INFO    MainThread:217 [wandb_init.py:init():842] run started, returning control to user process
+2024-05-22 11:34:30,865 INFO    MainThread:217 [wandb_run.py:_config_callback():1347] config_cb None None {'peft_config': {'default': {'peft_type': <PeftType.LORA: 'LORA'>, 'auto_mapping': None, 'base_model_name_or_path': 'core42/jais-13b', 'revision': None, 'task_type': 'CAUSAL_LM', 'inference_mode': False, 'r': 16, 'target_modules': {'c_attn'}, 'lora_alpha': 32, 'lora_dropout': 0.05, 'fan_in_fan_out': False, 'bias': 'none', 'use_rslora': False, 'modules_to_save': None, 'init_lora_weights': True, 'layers_to_transform': None, 'layers_pattern': None, 'rank_pattern': {}, 'alpha_pattern': {}, 'megatron_config': None, 'megatron_core': 'megatron.core', 'loftq_config': {}, 'use_dora': False, 'layer_replication': None}}, 'vocab_size': 84992, 'n_positions': 2048, 'n_embd': 5120, 'n_layer': 40, 'n_head': 40, 'n_inner': 13653, 'activation_function': 'swiglu', 'resid_pdrop': 0.0, 'embd_pdrop': 0.0, 'attn_pdrop': 0.0, 'layer_norm_epsilon': 1e-05, 'initializer_range': 0.02, 'scale_attn_weights': True, 'use_cache': False, 'scale_attn_by_inverse_layer_idx': False, 'reorder_and_upcast_attn': False, 'bos_token_id': 0, 'eos_token_id': 0, 'position_embedding_type': 'alibi', 'width_scale': 0.11100000000000002, 'embeddings_scale': 14.6, 'scale_qk_dot_by_d': True, 'return_dict': True, 'output_hidden_states': False, 'output_attentions': False, 'torchscript': False, 'torch_dtype': 'float32', 'use_bfloat16': False, 'tf_legacy_loss': False, 'pruned_heads': {}, 'tie_word_embeddings': True, 'chunk_size_feed_forward': 0, 'is_encoder_decoder': False, 'is_decoder': False, 'cross_attention_hidden_size': None, 'add_cross_attention': False, 'tie_encoder_decoder': False, 'max_length': 20, 'min_length': 0, 'do_sample': False, 'early_stopping': False, 'num_beams': 1, 'num_beam_groups': 1, 'diversity_penalty': 0.0, 'temperature': 1.0, 'top_k': 50, 'top_p': 1.0, 'typical_p': 1.0, 'repetition_penalty': 1.0, 'length_penalty': 1.0, 'no_repeat_ngram_size': 0, 'encoder_no_repeat_ngram_size': 0, 'bad_words_ids': None, 'num_return_sequences': 1, 'output_scores': False, 'return_dict_in_generate': False, 'forced_bos_token_id': None, 'forced_eos_token_id': None, 'remove_invalid_values': False, 'exponential_decay_length_penalty': None, 'suppress_tokens': None, 'begin_suppress_tokens': None, 'architectures': ['JAISLMHeadModel'], 'finetuning_task': None, 'id2label': {0: 'LABEL_0', 1: 'LABEL_1'}, 'label2id': {'LABEL_0': 0, 'LABEL_1': 1}, 'tokenizer_class': None, 'prefix': None, 'pad_token_id': 0, 'sep_token_id': None, 'decoder_start_token_id': None, 'task_specific_params': None, 'problem_type': None, '_name_or_path': 'core42/jais-13b', 'transformers_version': '4.41.0', 'auto_map': {'AutoConfig': 'core42/jais-13b--configuration_jais.JAISConfig', 'AutoModel': 'core42/jais-13b--modeling_jais.JAISModel', 'AutoModelForCausalLM': 'core42/jais-13b--modeling_jais.JAISLMHeadModel', 'AutoModelForQuestionAnswering': 'core42/jais-13b--modeling_jais.JAISForQuestionAnswering', 'AutoModelForSequenceClassification': 'core42/jais-13b--modeling_jais.JAISForSequenceClassification', 'AutoModelForTokenClassification': 'core42/jais-13b--modeling_jais.JAISForTokenClassification'}, 'model_type': 'jais', 'quantization_config': {'quant_method': 'QuantizationMethod.BITS_AND_BYTES', '_load_in_8bit': False, '_load_in_4bit': True, 'llm_int8_threshold': 6.0, 'llm_int8_skip_modules': None, 'llm_int8_enable_fp32_cpu_offload': False, 'llm_int8_has_fp16_weight': False, 'bnb_4bit_quant_type': 'nf4', 'bnb_4bit_use_double_quant': False, 'bnb_4bit_compute_dtype': 'bfloat16', 'bnb_4bit_quant_storage': 'uint8', 'load_in_4bit': True, 'load_in_8bit': False}, 'output_dir': '/kaggle/working/', 'overwrite_output_dir': False, 'do_train': False, 'do_eval': False, 'do_predict': False, 'eval_strategy': 'no', 'prediction_loss_only': False, 'per_device_train_batch_size': 8, 'per_device_eval_batch_size': 8, 'per_gpu_train_batch_size': None, 'per_gpu_eval_batch_size': None, 'gradient_accumulation_steps': 1, 'eval_accumulation_steps': None, 'eval_delay': 0, 'learning_rate': 0.0002, 'weight_decay': 0.0, 'adam_beta1': 0.9, 'adam_beta2': 0.999, 'adam_epsilon': 1e-08, 'max_grad_norm': 1.0, 'num_train_epochs': 2, 'max_steps': -1, 'lr_scheduler_type': 'linear', 'lr_scheduler_kwargs': {}, 'warmup_ratio': 0.0, 'warmup_steps': 0, 'log_level': 'passive', 'log_level_replica': 'warning', 'log_on_each_node': True, 'logging_dir': '/kaggle/working/runs/May22_11-33-56_2c1b614ec68f', 'logging_strategy': 'steps', 'logging_first_step': False, 'logging_steps': 10, 'logging_nan_inf_filter': True, 'save_strategy': 'epoch', 'save_steps': 500, 'save_total_limit': 4, 'save_safetensors': True, 'save_on_each_node': False, 'save_only_model': False, 'restore_callback_states_from_checkpoint': False, 'no_cuda': False, 'use_cpu': False, 'use_mps_device': False, 'seed': 42, 'data_seed': None, 'jit_mode_eval': False, 'use_ipex': False, 'bf16': True, 'fp16': False, 'fp16_opt_level': 'O1', 'half_precision_backend': 'auto', 'bf16_full_eval': False, 'fp16_full_eval': False, 'tf32': None, 'local_rank': 0, 'ddp_backend': None, 'tpu_num_cores': None, 'tpu_metrics_debug': False, 'debug': [], 'dataloader_drop_last': False, 'eval_steps': None, 'dataloader_num_workers': 0, 'dataloader_prefetch_factor': None, 'past_index': -1, 'run_name': '/kaggle/working/', 'disable_tqdm': False, 'remove_unused_columns': True, 'label_names': None, 'load_best_model_at_end': False, 'metric_for_best_model': None, 'greater_is_better': None, 'ignore_data_skip': False, 'fsdp': [], 'fsdp_min_num_params': 0, 'fsdp_config': {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}, 'fsdp_transformer_layer_cls_to_wrap': None, 'accelerator_config': {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}, 'deepspeed': None, 'label_smoothing_factor': 0.0, 'optim': 'adamw_torch', 'optim_args': None, 'adafactor': False, 'group_by_length': False, 'length_column_name': 'length', 'report_to': ['tensorboard', 'wandb'], 'ddp_find_unused_parameters': None, 'ddp_bucket_cap_mb': None, 'ddp_broadcast_buffers': None, 'dataloader_pin_memory': True, 'dataloader_persistent_workers': False, 'skip_memory_metrics': True, 'use_legacy_prediction_loop': False, 'push_to_hub': False, 'resume_from_checkpoint': None, 'hub_model_id': None, 'hub_strategy': 'every_save', 'hub_token': '<HUB_TOKEN>', 'hub_private_repo': False, 'hub_always_push': False, 'gradient_checkpointing': False, 'gradient_checkpointing_kwargs': None, 'include_inputs_for_metrics': False, 'eval_do_concat_batches': True, 'fp16_backend': 'auto', 'evaluation_strategy': None, 'push_to_hub_model_id': None, 'push_to_hub_organization': None, 'push_to_hub_token': '<PUSH_TO_HUB_TOKEN>', 'mp_parameters': '', 'auto_find_batch_size': True, 'full_determinism': False, 'torchdynamo': None, 'ray_scope': 'last', 'ddp_timeout': 1800, 'torch_compile': False, 'torch_compile_backend': None, 'torch_compile_mode': None, 'dispatch_batches': None, 'split_batches': None, 'include_tokens_per_second': False, 'include_num_input_tokens_seen': False, 'neftune_noise_alpha': None, 'optim_target_modules': None, 'batch_eval_metrics': False}
+2024-05-22 11:34:30,875 INFO    MainThread:217 [wandb_config.py:__setitem__():151] config set model/num_parameters = 13033919160 - <bound method Run._config_callback of <wandb.sdk.wandb_run.Run object at 0x7ef9227a9060>>
+2024-05-22 11:34:30,876 INFO    MainThread:217 [wandb_run.py:_config_callback():1347] config_cb model/num_parameters 13033919160 None
+2024-05-22 14:04:41,874 INFO    MainThread:217 [jupyter.py:save_ipynb():373] not saving jupyter notebook
+2024-05-22 14:04:41,875 INFO    MainThread:217 [wandb_init.py:_pause_backend():432] pausing backend
+2024-05-22 14:14:52,958 INFO    MainThread:217 [wandb_init.py:_resume_backend():437] resuming backend
+2024-05-22 14:14:54,437 INFO    MainThread:217 [jupyter.py:save_ipynb():373] not saving jupyter notebook
+2024-05-22 14:14:54,437 INFO    MainThread:217 [wandb_init.py:_pause_backend():432] pausing backend
+2024-05-22 14:15:26,186 INFO    MainThread:217 [wandb_init.py:_resume_backend():437] resuming backend
+2024-05-22 14:16:25,347 INFO    MainThread:217 [jupyter.py:save_ipynb():373] not saving jupyter notebook
+2024-05-22 14:16:25,348 INFO    MainThread:217 [wandb_init.py:_pause_backend():432] pausing backend
+2024-05-22 14:16:29,691 INFO    MainThread:217 [wandb_init.py:_resume_backend():437] resuming backend
+2024-05-22 14:16:44,749 INFO    MainThread:217 [jupyter.py:save_ipynb():373] not saving jupyter notebook
+2024-05-22 14:16:44,749 INFO    MainThread:217 [wandb_init.py:_pause_backend():432] pausing backend
+2024-05-22 14:23:14,136 INFO    MainThread:217 [wandb_init.py:_resume_backend():437] resuming backend
+2024-05-22 14:23:16,353 INFO    MainThread:217 [jupyter.py:save_ipynb():373] not saving jupyter notebook
+2024-05-22 14:23:16,353 INFO    MainThread:217 [wandb_init.py:_pause_backend():432] pausing backend
+2024-05-22 14:26:18,732 INFO    MainThread:217 [wandb_init.py:_resume_backend():437] resuming backend
+2024-05-22 14:26:19,623 INFO    MainThread:217 [jupyter.py:save_ipynb():373] not saving jupyter notebook
+2024-05-22 14:26:19,624 INFO    MainThread:217 [wandb_init.py:_pause_backend():432] pausing backend
+2024-05-22 14:34:00,493 INFO    MainThread:217 [wandb_init.py:_resume_backend():437] resuming backend
+2024-05-22 14:34:00,984 INFO    MainThread:217 [jupyter.py:save_ipynb():373] not saving jupyter notebook
+2024-05-22 14:34:00,984 INFO    MainThread:217 [wandb_init.py:_pause_backend():432] pausing backend
+2024-05-22 14:34:16,410 INFO    MainThread:217 [wandb_init.py:_resume_backend():437] resuming backend

wandb/run-20240522_113413-8mudzhjp/run-8mudzhjp.wandb ADDED Viewed

Binary file (492 kB). View file